-
Notifications
You must be signed in to change notification settings - Fork 59
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Requests to ERDDAP TableDAP and Desired Response Format #798
Comments
@daltonkell Good description of the situation.
For the case of the 'Single Platform' rule in question, I don't think working with any of ERDDAP's various formats should change the data for a non-compliant dataset such that it could be made to be in compliance. Take a DSG timeSeries dataset with one 'station dimension' (call it In the
The result is that the same value is repeated for each of TimeSeries example: Here's a test dataset I created that also shows this (although in this case the 'station dimension' is 2 because it represents two different sensor depths on the same platform). But the principle is the same, in the http://testing.erddap.axds.co/erddap/tabledap/sun2wave_timeseries_micah.ncHeader In the .ncCFMA output format however it has dimension length of 2 (one for each sensor height): http://testing.erddap.axds.co/erddap/tabledap/sun2wave_timeseries_micah.ncCFMAHeader This dataset therefore has 'station dimension' = 2, and only meets the 'Single Platform' rule in our profile because this is the special exception we're making for timeSeries datsets to accommodate for sensors at different heights on the same platform - the lat/lon values could technically vary as well, but they shouldn't since it's the same physical 'platform'. The htmlTable output from the dataset shows what I mean (note the repeated values for those variables): So, as long as you test either Internally, ERDDAP looks for whichever variable is labeled with So, long way to say, I guess I don't see any other options. Hopefully this helps make the point though. Also, regarding:
Using one of these two types should solve the |
@mwengren This is a great explanation, thank you!
Yes, for sure. This takes the raw OPeNDAP output and will make it into the conventional format we're used to. I'm going to check out how we might facilitate something like this, we may have to get creative. Pinging @benjwadams and @Bobfrat to keep them in the loop. |
Closing this issue as it relates to #801 |
Per the discussion in the ERDDAP Google Group,
https://groups.google.com/d/msg/erddap/kyYF8wcNeME/p5TSngqPAwAJ
The default
ERDDAP
response for a format-agnostic TableDAP request is essentially a raw OPeNDAP sequence. The resulting dataset is one withs.
pre-pended to all the variable names:(dataset used in Google group example, courtesy R. Signell)
This can cause several checks to fail for at least two reasons:
If a variable
s.a
has an attribute which refers to another variable by name,b
, and must useb
in a check,b
no longer exists but instead is replaced bys.b
The dimensions of the OPeNDAP sequence are almost guaranteed to not be CF DSG-compliant
This raises the all-important question: What format should we request when supplying an ERDDAP URL to the Compliance Checker?
ERDDAP can provide the same dataset in 44 different formats. Typically, when running the checker against a dataset, the
.ncCF
(NetCDF Contiguous Ragged Array) data format or.ncCFMA
(NetCDF CF Discrete Sampling Geometries File) format work best, since the binary data is encoded in such a way that the features have a single instance dimension. Other formats, such as the.nc
(plain, table-like NetCDF-3 format) align the variables using different dimensions (this does not alter the geospatial location of the data in any way) which may cause the dataset to fall out of compliance.I am wary to say, "Let's just request the .ncCF or .ncCFMA" because I am not yet convinced that changing the format doesn't equate to changing the data. If this were true, could compliance simply be reached by reshuffling the dimensions? I think it's an important point to discuss.
Furthermore, requesting the
.ncCF
or.ncCFMA
format isn't just as easy as appending the extension to the ERDDAP URL -- the file will probably have to be created locally on each run of the checker. See this preliminary result:The text was updated successfully, but these errors were encountered: