OBIS: Time series data

From ILMS-Wiki
(Difference between revisions)
Jump to: navigation, search
(Define rules for interpolation methods)
(View list of available datasets)
 
(21 intermediate revisions by 2 users not shown)
Line 1: Line 1:
The section time series data is one of the core features of OBIS. OBIS allows the management, visualization and download of time series data.  
+
The section time series data is one of the core features of OBIS. OBIS allows the management, visualization and download of time series data. In this section it will be described how to browse through time series data.
  
 
[[Using_the_Okavango_Basin_Information_System_(OBIS)|[Back to tutorial main page]]]
 
[[Using_the_Okavango_Basin_Information_System_(OBIS)|[Back to tutorial main page]]]
Line 7: Line 7:
 
The section "Time series data" gives first an overview of all datasets. The measured data are directly linked to the stations, where they have been measured. The link "Export CSV" on the bottom of the table gives the possibility to download a list of all records you see in the overview table. The menu on the left side offers different features like the import of data, Jams XML and Indicator calculation. The tutorial will introduce these functions in the section [[OBIS: Edit Time series data|Adding and modifying data - Time series data]]. At this stage you will learn how to browse for data, search data and view the time series data.  
 
The section "Time series data" gives first an overview of all datasets. The measured data are directly linked to the stations, where they have been measured. The link "Export CSV" on the bottom of the table gives the possibility to download a list of all records you see in the overview table. The menu on the left side offers different features like the import of data, Jams XML and Indicator calculation. The tutorial will introduce these functions in the section [[OBIS: Edit Time series data|Adding and modifying data - Time series data]]. At this stage you will learn how to browse for data, search data and view the time series data.  
  
On the screen most important information are listed:
+
On the screen you see the data overview table with the headings:
* '''Dataset title''' consists of measured parameter and station name, e.g. "Climate Benguela"
+
* '''Dataset title''' consists usually of measured parameter and station name, e.g. "Climate Benguela"
 
* '''Start''' of time series (yy-mm-dd)
 
* '''Start''' of time series (yy-mm-dd)
 
* '''End''' of time series (yy-mm-dd)
 
* '''End''' of time series (yy-mm-dd)
 
* '''Time step''', e.g. daily, monthly
 
* '''Time step''', e.g. daily, monthly
* '''TS Measurement + Data Gaps''', e.g. precipitation, discharge
+
* '''TS Measurement + Data Gaps''', shows the measured parameter. In case that more than one paramter is available the number of parameters is given in the column. When you place the mouse cursor on "Details" the list of all measured parameters appears. When you click on "Details" a new table appears, showing the list of parameters and data gaps. Please note: The information data gaps is only available for daily data and hourly data. In case of monthly data, you will get the message 'No gap analysis'.
* '''TS Data''' contains the link to access the data [show]
+
* '''TS Data''' contains the link to access the data [Show]
  
 
[[File:OBIS_Timeseries.png|1000px|OBIS Timeseries]]
 
[[File:OBIS_Timeseries.png|1000px|OBIS Timeseries]]
Line 19: Line 19:
 
== Metadata ==
 
== Metadata ==
  
For showing all information connected to a dataset you need to click on "details" in the row of a dataset you are interested in, e.g. the example "Climate Luanda".  
+
For showing all information connected to a dataset you need to click on "Details" in the row of a dataset you are interested in, e.g. the example "Climate Tsumeb".  
 
First metadata are listed. They are grouped in Title, Time series information, Dataset and Station details.
 
First metadata are listed. They are grouped in Title, Time series information, Dataset and Station details.
  
 
[[File:OBIS_Timeseries_Details.png|400px|OBIS Timeseries]]
 
[[File:OBIS_Timeseries_Details.png|400px|OBIS Timeseries]]
  
If you click in the register "TS Measurements + Data Gaps" time series connected to the station Luanda were displayed. For the example Luanda eight time series are available. There exists one time series for each parameter, e.g. temperature, precipitation, humidity and others. Further you ahve the possibility to show nearby stations. This feature is explained in the section [[#Find nearby stations|Find nearby stations]].
+
If you click in the register "TS Measurements + Data Gaps" time series connected to the station Tsumeb were displayed. For the example Tsumeb eleven time series are available. There exists one time series for each parameter, e.g. temperature, precipitation, humidity and others. You can see the number of data gaps for each parameter. For more information use the link "show" in the column 'Data Gaps' in the row of the parameter of interest. Further you have the possibility to show nearby stations. How to use this feature is explained in the section [[#Find nearby stations|Find nearby stations]].
  
 
[[File:OBIS_Timeseries_TS.png|800px|]]
 
[[File:OBIS_Timeseries_TS.png|800px|]]
  
== View and download time series data ==
+
==View and download time series data==
 
The section '''"overall statistics and actions"''' lists the number of entries in total and the number of entries for each parameter in brackets. You get an overview of the completeness of the time series or number of gaps. Start and end date of the time series is shown. For the export of data you have two possibilities: If you are interested in the entire dataset, use the download link "'''database data'''" and you get a txt-file containing all data and metadata. The example "climate Luanda" file opened in an editor (see figure below), shows you station information and time series information, lists the parameters in their correct order and gives you the monthly data. Using this download link you get the data as they are stored in the database (no default values, no date gaps).
 
The section '''"overall statistics and actions"''' lists the number of entries in total and the number of entries for each parameter in brackets. You get an overview of the completeness of the time series or number of gaps. Start and end date of the time series is shown. For the export of data you have two possibilities: If you are interested in the entire dataset, use the download link "'''database data'''" and you get a txt-file containing all data and metadata. The example "climate Luanda" file opened in an editor (see figure below), shows you station information and time series information, lists the parameters in their correct order and gives you the monthly data. Using this download link you get the data as they are stored in the database (no default values, no date gaps).
 
Using the download link "'''file'''" the original file will be exported as it has been uploaded to the database.
 
Using the download link "'''file'''" the original file will be exported as it has been uploaded to the database.
Line 35: Line 35:
  
 
[[File:OBIS_Timeseries_TSData.png|1000px|]]
 
[[File:OBIS_Timeseries_TSData.png|1000px|]]
 
Visualization of a txt-file in PSPad (download via the link "database data")
 
  
[[File:download_database_data.png|1000px|]]
+
== Export file==
  
The export delivers a tab stop splitted tet file. The header contains most important metadata information: Name of the RBIS (e.g. Okavango Basin Information System), Date of download, user name, dataset title, station name, coordinates and spatial reference system, elevation, time series ID, metadata ID and column description. The example shows a complete dataset without gaps. If data had or have data gaps you will further get the information of applied interpolation methods, date of interpolation and a declaration of completeness in the txt file.
+
The screenshot below shows the visualization of a txt-file in PSPad. The file has been downloaded via the link "database data".  
  
== Time series data analysis ==
+
The export delivers a tab stop splitted text file. The header contains most important metadata information: name of the RBIS (e.g. Okavango Basin Information System), date of download, user name, dataset title, station name, coordinates and spatial reference system, elevation, time series ID, metadata ID and column description. The example shows a complete dataset without gaps. If data had or have data gaps you will further get the information of applied interpolation methods, date of interpolation and a declaration of completeness in the txt file.
  
=== Data gaps ===
+
[[File:download_database_data.png|1000px|]]
 
+
Data gaps in time series will be automatically identified during the import process (see [[OBIS: Edit Time series data]]).
+
 
+
The percentage next to the number of entries in the first row shows the completeness of the time series for each parameter.
+
In the Interval statistics section you will find below-mentioned to each parameter the number of missing values and the sum of gaps. These values are linked to the full list of data gaps. The color of the following "I" shows if data gaps are already filled (green), partly filled (blue) or not filled (red). To fill data gaps please follow the link '''"I"'''. The [[OBIS:_Time_series_data#Gap_filling_toolbox|Gap filling Toolbox]] will open in a new tab and provide several methods to fill data gaps.
+
 
+
[[File:OBIS_Timeseries_Gaps.png|600px|]]
+
 
+
=== Find nearby stations ===
+
 
+
RBIS allows to search for nearby stations with respect to one previously slected parameter (e.g. precipitation). You can search for neighboring stations by a variety of different attributes: distance (in m), elevation (in m), coefficient of determination (rsquared) and the time interval. The number of results can be limited and ordered either by distance or the coefficient of determination. To simulate the same behavior as during an interpolation process (see [[OBIS:_Time_series_data#Gap_filling_toolbox|Gap filling Toolbox]]) tick on '''"Exclude raw data"'''. This will exclude all time series data marked as "raw data".
+
 
+
'''Notice''': Before you can use the coefficient of determination you have to calculate it. Click the link on the bottom named "update calculation of r²".
+
 
+
The table lists nearby stations, their distance, dataset title, elevation, coefficient of determination (rsquared) and start and end date of time series. If the number of your listed datasets is less than 8, a plot can be generated, where all data gaps are marked in red. This is very helpful to identify possible neighboring stations to fill data gaps.
+
 
+
'''Show gap information''' will generate a list as it would be generated during the interpolation process (see [[OBIS:_Time_series_data#Gap_filling_toolbox|Gap filling Toolbox]]). In comparison to the above list in addition the time interval and unit have to be equal and data values have to be available in respect to each single gap interval.
+
 
+
[[File:OBIS_Timeseries_ShowNearStation.png|800px|]]
+
 
+
== Gap filling toolbox ==
+
 
+
Identified data gaps can be filled either by the internal rule based gap filling toolbox or externally.
+
+
=== Internal interpolation methods===
+
 
+
RBIS includes different internal interpolation methods. The methods are explained in the section "Internal Interpolation Method Set". You can chose between following internal interpolation methods:
+
 
+
* '''''inverse distance weighting'''''
+
 
+
Inverse distance weighted methods are based on the assumption that the interpolating surface should be influenced most by the nearby points and less by the more distant points. The interpolating surface is a weighted average of the scatter points and the weight assigned to each scatter point diminishes as the distance from the interpolation point to the scatter point increases.
+
 
+
Inverse distance weighting (IDW) uses the maximum given number of stations. If this number is not available the number is reduced to the given minimum number of stations.
+
 
+
'''[1] IDW (5-3)''' Inverse distance weighting (IDW) (max 5 stations - min 3 stations are used)
+
 
+
'''[1a] IDW & elevation correction (5-3)''' Inverse distance weighting (IDW) with elevation correction (max 5 stations - min 3 stations are used). Elevation correction is done by adjusting the values of the used stations to the elevation of the source station. These corrected values are used for IDW. If IDW with elevation correction is not possible only IDW is used instead. The default exponent is 2 otherwise it can be changed by the value of the parameter field.
+
 
+
 
+
* '''''linear regression'''''
+
 
+
Linear regression is an approach to modeling the relationship between a scalar dependent variable y and one or more explanatory variables denoted X. The case of one explanatory variable is called simple regression. More than one explanatory variable is multiple regression.
+
 
+
'''[2] linear regression''' with the values of the station with the best r² is used.
+
 
+
'''[2a] linear regression (0.7''') Linear regression with the values of the station with the best r² in a radius of 10 stations is used if r² is at least 0.7 or greater.
+
 
+
'''[2b] linear regression (0.7/30km)''' Linear regression with the values of the station with the best r² in a radius of 10 stations which is at least 0.7 or greater is used. The maximum distance of this station is 30km.
+
 
+
 
+
* '''''linear interpolation'''''
+
 
+
Linear interpolation is a method of curve fitting. If two known points are given (value before and after gap), the linear interpolant is the straight line between these points. The default exponent is 2, otherwise it can be changed by the value of the parameter field.
+
 
+
'''[3] linear interpolation'''
+
 
+
'''[3a] linear int/regr. (l3/0.7)''' Linear interpolation is used if gap length is not more than 3. Gaps greater 3 are closed by linear regression with the values of the station with the best r² in a radius of 10 stations is used if r² is at least 0.7 or greater.
+
 
+
 
+
* '''''nearest neighbor'''''
+
 
+
Nearest-neighbor interpolation (also known as proximal interpolation or, in some contexts, point sampling) is a simple method of multivariate interpolation in one or more dimensions.
+
Interpolation is the problem of approximating the value of a function for a non-given point in some space when given the value of that function in points around (neighboring) that point. The nearest neighbor algorithm selects the value of the nearest point and does not consider the values of neighboring points at all, yielding a piecewise-constant interpolant.
+
 
+
'''[4] nearest neighbour'''
+
 
+
 
+
''screen method selection''
+
 
+
[[File:OBIS_Timeseries_InternalInterpolation.png|600px|]]
+
 
+
=== Define rules for interpolation methods===
+
 
+
The user selects one method or can chose a combination of interpolation methods. In addition the user has the possibility to define rules for the application of the method by clicking on "''Internal Interpolation Method Set"''. A description of the method is given here. With support of selected rules the interpolation method can be adapted. Here values can be set for the parameters: length of data gap until the method shall be applied, maximum and minimum station number, coefficient of determination r2 and maximum distance. If you want to change the default setting, click on the button "Edit".
+
To set a maximum length of data gap is especially recommended for the linear interpolation method as this method is not suitable for larger data gaps.
+
 
+
Interpolation cannot be conducted in case that no matching time series were found or no matching method was found.
+
 
+
''screen internal interpolation method set''
+
 
+
[[File:OBIS_internal_interpolation_method_set.png|1000px|]]
+
 
+
Default values have been set, e.g. maximum number of stations is 5, minimum nuber is 3 and the minimum coefficient for determination was set to 0.7.
+
 
+
The section "Internal Limit values" offers a table containing the value range of all parameters, e.g. humidity: minimum value is 0, maximum value is 100.
+
 
+
===External interpolation===
+
 
+
The section "External Interpolation" offers the possibility to conduct an interpolation out of the OBIS and upload the file into OBIS.
+
If you use an external method you need to chose the method in the selection menue or describe a new method. The import function takes only those values which lie within a previously described data gap, even if the timeseries is longer. The values within the data gap were treated by OBIS like interpolated values. The gap interval can only be filled completely or not. If only one value in the timeseries is missing, it will not be used to close the gap.
+
 
+
Further you need to specify the upload file. You can chose ''add'' to complement the timeseries or ''new'' to add new data after you have previously deleted already exitsing data. This feature allows to close still existing gaps without losing former interpolation results.
+
  
[[File:OBIS_External_Interpolation.png|400px|]]
 
  
 
[[Using_the_Okavango_Basin_Information_System_(OBIS)|[Back to tutorial main page]]]
 
[[Using_the_Okavango_Basin_Information_System_(OBIS)|[Back to tutorial main page]]]

Latest revision as of 13:02, 21 March 2013

The section time series data is one of the core features of OBIS. OBIS allows the management, visualization and download of time series data. In this section it will be described how to browse through time series data.

[Back to tutorial main page]

Contents

View list of available datasets

The section "Time series data" gives first an overview of all datasets. The measured data are directly linked to the stations, where they have been measured. The link "Export CSV" on the bottom of the table gives the possibility to download a list of all records you see in the overview table. The menu on the left side offers different features like the import of data, Jams XML and Indicator calculation. The tutorial will introduce these functions in the section Adding and modifying data - Time series data. At this stage you will learn how to browse for data, search data and view the time series data.

On the screen you see the data overview table with the headings:

  • Dataset title consists usually of measured parameter and station name, e.g. "Climate Benguela"
  • Start of time series (yy-mm-dd)
  • End of time series (yy-mm-dd)
  • Time step, e.g. daily, monthly
  • TS Measurement + Data Gaps, shows the measured parameter. In case that more than one paramter is available the number of parameters is given in the column. When you place the mouse cursor on "Details" the list of all measured parameters appears. When you click on "Details" a new table appears, showing the list of parameters and data gaps. Please note: The information data gaps is only available for daily data and hourly data. In case of monthly data, you will get the message 'No gap analysis'.
  • TS Data contains the link to access the data [Show]

OBIS Timeseries

Metadata

For showing all information connected to a dataset you need to click on "Details" in the row of a dataset you are interested in, e.g. the example "Climate Tsumeb". First metadata are listed. They are grouped in Title, Time series information, Dataset and Station details.

OBIS Timeseries

If you click in the register "TS Measurements + Data Gaps" time series connected to the station Tsumeb were displayed. For the example Tsumeb eleven time series are available. There exists one time series for each parameter, e.g. temperature, precipitation, humidity and others. You can see the number of data gaps for each parameter. For more information use the link "show" in the column 'Data Gaps' in the row of the parameter of interest. Further you have the possibility to show nearby stations. How to use this feature is explained in the section Find nearby stations.

OBIS Timeseries TS.png

View and download time series data

The section "overall statistics and actions" lists the number of entries in total and the number of entries for each parameter in brackets. You get an overview of the completeness of the time series or number of gaps. Start and end date of the time series is shown. For the export of data you have two possibilities: If you are interested in the entire dataset, use the download link "database data" and you get a txt-file containing all data and metadata. The example "climate Luanda" file opened in an editor (see figure below), shows you station information and time series information, lists the parameters in their correct order and gives you the monthly data. Using this download link you get the data as they are stored in the database (no default values, no date gaps). Using the download link "file" the original file will be exported as it has been uploaded to the database.

In the section "Interval statistics and actions" you have the possibility to view the time series data in a plot. You can set a time interval and parameters of interest. Therefore enter the time interval in the entry fields, activate the checkboxes of the parameters of interest and confirm the selection by clicking "set interval & parameter". By clicking on the link "show plot" a graph appears visualizing time series data for the timeframe and parameters selected before. By clicking on the link "database data" you get the time series for this respective timeframe and parameters. You can also chose the function "Export aggregated" for download. This feature allows to export e.g. yearly data, average data, max or min values or sum. Tick the checkboxes of the required output and click on "download".

OBIS Timeseries TSData.png

Export file

The screenshot below shows the visualization of a txt-file in PSPad. The file has been downloaded via the link "database data".

The export delivers a tab stop splitted text file. The header contains most important metadata information: name of the RBIS (e.g. Okavango Basin Information System), date of download, user name, dataset title, station name, coordinates and spatial reference system, elevation, time series ID, metadata ID and column description. The example shows a complete dataset without gaps. If data had or have data gaps you will further get the information of applied interpolation methods, date of interpolation and a declaration of completeness in the txt file.

Download database data.png


[Back to tutorial main page]

Personal tools