Version: 1.1.7
The sample code in this folder demonstrates how to utilize the Cds Data Views to do some basic data analysis using Python Jupyter Notebook. In order to run this sample, you need to have Python installed.
Developed against Python 3.9.1.
This sample is intended to show you how you can use an Cds Data View to bring your data from Cds into Jupyter and easily into Pandas to do machine learning. Using an Cds Data View allows you to configure the data you want, in the way you want it, and not have to do as much client side processing of the data. This is a basic sample that shows you a possible way to do machine learning and a possible outcome of looking at the data. This is not a guide to Pandas or machine learning.
Wind turbines operate by capturing kinetic enrgy of wind to turn rotor blades that run a generator. Governed by Betz's law they can only capture 59.3% of kinetic enrgy from wind. This is important, because after a certain limit higher winds do not create additional power. At a certain wind speed, the power actually decreases.
Can we predict the power output of our wind turbines based on weather forecast data?
- Clone the GitHub repository
- Install the required modules by running the following command in the terminal :
pip install -r requirements.txt
- If running the tests, install the required modules by running the following command in the terminal :
pip install -r test-requirements.txt
The data we are using is available in our sample-ocs-bulk_upload-dotnet repository SampleCollections folder. The steps to upload this data are included in that folder. This sample also skips over the exercise of creating the data view, this is included in the data that you can upload. To see how to create a Data View programmatically, please look at our other samples here.
This sample data is similar to what you might have from a site where the data has unexpected values and not every turbine behaves exactly the same. The data is only for 1 day. Repeating this exercise but using more days of data would give you a better prediction.
- In the
Jupyter
folder, populate the values ofappsettings.placeholder.json
with your own system configuration. Before editing, rename this file toappsettings.json
. This repository's.gitignore
rules should prevent the file from ever being checked in to any fork or branch, to ensure credentials are not compromised.
For example:
{
"Resource": "https://uswe.datahub.connect.aveva.com",
"ApiVersion": "v1",
"TenantId": "PLACEHOLDER_REPLACE_WITH_TENANT_ID",
"NamespaceId": "PLACEHOLDER_REPLACE_WITH_NAMESPACE_ID",
"CommunityId": null,
"ClientId": "PLACEHOLDER_REPLACE_WITH_APPLICATION_IDENTIFIER",
"ClientSecret": "PLACEHOLDER_REPLACE_WITH_APPLICATION_SECRET"
}
Open a terminal and type in jupyter notebook
. This will open a browser window. Navigate to the cloned repository and open up Wind_Turbine_Cds_Data_Cds_Python_Library.ipynb
. Run the cells one by one and you can see the output in browser itself.
The last cell in the notebook is for running unit tests so that you can test to make sure the whole notebook is working as expected. As it tests the methods defined earlier in the notebook, you need to run the previous cells of the notebook before trying to run the unit tests.
The automated tests assume that the data is already lodaded. The steps to load the data and delete the data are included in the .yml but are commented out to not send and delete large amounts of the same data often.
For the main Cds samples page ReadMe
For the main AVEVA samples page ReadMe