-
Notifications
You must be signed in to change notification settings - Fork 39
/
Copy pathmetrics_v0.6.yaml
572 lines (554 loc) · 34.3 KB
/
metrics_v0.6.yaml
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
# LIST OF FAIRSFAIR METRICS AND THEIR RESPONSE OUTPUT FORMATS
config:
metric_specification: https://doi.org/10.5281/zenodo.4081213
metric_status: draft
metrics:
## ---------------- FINDABILITY ---------------- ##
- metric_identifier: FsF-F1-01MD
metric_number: 1
metric_short_name: Unique Identifier Metadata and Data
metric_name: Metadata and data are assigned a globally unique identifier.
description: A globally unique identifier may be assigned to a landing page containing metadata, a metadata file or a data file such that it can be referenced unambiguously by humans or machines. Globally unique means an identifier should be associated with only one resource at any time. Examples of unique identifiers are Internationalized Resource Identifier (IRI), Uniform Resource Identifier (URI) such as URL and URN, Digital Object Identifier (DOI), the Handle System, identifiers.org, w3id.org and Archival Resource Key (ARK). A data repository may assign a globally unique identifier to your metadata when you publish and make it available through their services.
fair_principle: F1
target: Metadata and Data
evaluation_mechanism: Identifier is considered unique if it is successfully validated using syntax or scheme for URI, URN, UUID or HASH like identifiers or persistent identifiers.
test_scoring_mechanism: cumulative
metric_tests:
- metric_test_identifier: FsF-F1-01MD-1
metric_test_name: Metadata identifier follows a defined unique identifier syntax or scheme (IRI, URL, UUID, HASH or PID)
metric_test_score: 1
metric_test_maturity: 3
metric_test_requirements:
- expected_values: https://f-uji.net/vocab/identifier/unique
tested_on: https://f-uji.net/vocab/metadata/property/object_identifier
modality: any
comment: identifier can be given as user input
- metric_test_identifier: FsF-F1-01MD-2
metric_test_name: Data identifier follows a defined unique identifier syntax (IRI, URL, UUID, HASH or PID)
metric_test_score: 0
metric_test_maturity: 3
created_by: FAIRsFAIR
date_created: 2024-12-13
date_updated: 2024-12-13
version: 0.6
total_score: 1
- metric_identifier: FsF-F1-02MD
metric_number: 2
metric_short_name: Persistent Identifier Metadata and Data
metric_name: Metadata and data are assigned a persistent identifier.
description: Both, metadata as well as data should be provided with a peristent identifier. We make a distinction between the uniqueness and persistence of an identifier. An HTTP URL (the address of a given unique resource on the web) is globally unique, but may not be persistent as the URL of data may be not accessible (link rot problem) or the data available under the original URL may be changed (content drift problem). Identifiers based on the Handle System, DOI, ARK are both globally unique and persistent. They are maintained and governed such that they remain stable and resolvable for the long term. The persistent identifier (PID) may be resolved (point) to a landing page, metadata or the data content (downloadable artefact), or none if the data or repository is no longer maintained. Therefore, ensuring persistence is a shared responsibility between a PID service provider (e.g., datacite) and its clients (e.g., data repositories). For example, the DOI system guarantees the persistence of its identifiers through its social (e.g., policy) and technical infrastructures, whereas a data provider ensures the availability of the resource (e.g., landing page, downloadable artefact) associated with the identifier.
fair_principle: F1
target: Metadata and Data
evaluation_mechanism: A persistent identifier is considered to be valid if the given identifier complies with a valid PID synthax. To be valid, the PID further has to be registered or maintained by a PID authority which is verified in case a PID is redirected by the PID system regardless of whether this redirection is successful.(see also A1).
test_scoring_mechanism: alternative
metric_tests:
- metric_test_identifier: FsF-F1-02MD-1
metric_test_name: Metadata identifier follows a defined persistent identifier syntax
metric_test_score: 0.5
metric_test_maturity: 1
metric_test_requirements:
- expected_values: https://f-uji.net/vocab/identifier/persistent
modality: any
tested_on: https://f-uji.net/vocab/metadata/property/object_identifier
comment: identifier can be given as user input
- metric_test_identifier: FsF-F1-02MD-2
metric_test_name: Persistent identifier for metadata is registered and maintained by a PID authority
metric_test_requirements:
- expected_values: https://f-uji.net/vocab/identifier/persistent
tested_on: https://f-uji.net/vocab/metadata/property/object_identifier
comment: identifier has to be redirected which verifies it is registered in a PID system
metric_test_score: 0.5
metric_test_maturity: 2
- metric_test_identifier: FsF-F1-02MD-4
metric_test_name: Data identifier follows a defined persistent identifier syntax
metric_test_score: 0
metric_test_maturity: 3
metric_test_requirements:
- expected_values: https://f-uji.net/vocab/identifier/persistent
tested_on: https://f-uji.net/vocab/metadata/property/object_identifier
modality: any
comment: test is not scored
- metric_test_identifier: FsF-F1-02MD-5
metric_test_name: Persistent identifier for data is registered and maintained by a PID authority
metric_test_score: 0
metric_test_maturity: 3
metric_test_requirements:
- expected_values: https://f-uji.net/vocab/identifier/persistent
tested_on: https://f-uji.net/vocab/metadata/property/object_identifier
comment: identifier has to be redirected which verifies it is registered in a PID system; test is not scored
created_by: FAIRsFAIR
date_created: 2024-12-13
date_updated: 2024-12-13
version: 0.6
total_score: 1
- metric_identifier: FsF-F2-01M
metric_number: 5
metric_short_name: Descriptive Core Metadata
metric_name: Metadata includes descriptive core elements (creator, title, data identifier, publisher, publication date, summary and keywords) to support data findability.
description: Metadata is descriptive information about a data object. Since the metadata required differs depending on the users and their applications, this metric focuses on core metadata. The core metadata is the minimum descriptive information required to enable data finding, including citation which makes it easier to find data. We determine the required metadata based on common data citation guidelines (e.g., DataCite, ESIP, and IASSIST), and metadata recommendations for data discovery (e.g., EOSC Datasets Minimum Information (EDMI), DataCite Metadata Schema, W3C Recommendation Data on the Web Best Practices and Data Catalog Vocabulary). This metric focuses on domain-agnostic core metadata. Domain or discipline-specific metadata specifications are covered under metric FsF-R1.3-01M. A repository should adopt a schema that includes properties of core metadata, whereas data authors should take the responsibility of providing core metadata.
fair_principle: F2
target: Metadata
evaluation_mechanism: Metadata can be offered in different ways. here we focus on common web based strategies. These include 1) embedding metadata within the landing page such as JSON-LD, OpenGraph, Microdata, Dublin Core, 2) offering typed links which lead to metadata within the HTML code of the metadata or signposting links. 3) enable content negotiation and deliver e.g. RDF, JSON-LD or XML on demand. The metric evaluates the completeness of metadata in case metadata has been retrieved.
test_scoring_mechanism: cumulative
metric_tests:
- metric_test_identifier: FsF-F2-01M-2
metric_test_name: Core data citation metadata is available
metric_test_score: 0.5
metric_test_maturity: 2
metric_test_requirements:
- expected_values: https://f-uji.net/vocab/metadata/property
modality: all
tested_on: https://f-uji.net/vocab/metadata/property
required:
name:
- creator
- title
- object_identifier
- publication_date
- publisher
- object_type
- metric_test_identifier: FsF-F2-01M-3
metric_test_name: Core descriptive metadata is available
metric_test_score: 1
metric_test_maturity: 3
metric_test_requirements:
- expected_values: https://f-uji.net/vocab/metadata/property
modality: all
tested_on: https://f-uji.net/vocab/metadata/property
required:
name:
- creator
- title
- object_identifier
- publication_date
- publisher
- object_type
- summary
- keywords
created_by: FAIRsFAIR
date_created: 2020-07-08
date_updated: 2022-05-30
version: 0.5
total_score: 2
- metric_identifier: FsF-F3-01M
metric_number: 6
metric_short_name: Inclusion of Data Identifier in Metadata
metric_name: Metadata includes the identifier of the data it describes.
description: The metadata should explicitly specify the identifier of the data such that users can discover and access the data through the metadata. If the identifier specified is persistent and points to a landing page, the data identifier and links to download the data content should be taken into account in the assessment.
fair_principle: F3
target: Metadata
evaluation_mechanism: Several metadata standards provide the possibility to include links to the actual data content. The presence of such links is evaluated here.
test_scoring_mechanism: cumulative
metric_tests:
- metric_test_identifier: FsF-F3-01M-2
metric_test_name: Metadata contains a PID or URL which indicates the location of the downloadable data content
metric_test_score: 1
metric_test_maturity: 3
metric_test_requirements:
- expected_values: https://f-uji.net/vocab/data/property
expected_property: https://f-uji.net/vocab/metadata/property/object_content_identifier
modality: any
required:
name:
- url
created_by: FAIRsFAIR
date_created: 2020-07-08
date_updated: 2022-05-30
version: 0.5
total_score: 1
- metric_identifier: FsF-F4-01M
metric_number: 7
metric_short_name: Searchable Metadata
metric_name: Metadata is offered in such a way that it can be registered or indexed by search engines.
description: Metadata can be available via multiple endpoints. For example, a repository may distribute its metadata via a metadata protocol (e.g. via OAI-PMH) and/or a custom web service, but these may only support certain catalogs. This metric focuses on those methods of making metadata available that are beneficial to as many user groups as possible, i.e. that can be consumed by well-known, large catalogs and search engines such as Google and Bing. Such metadata should be offered according to the requirements of these search engines.
target: Metadata
evaluation_mechanism: The metric is evaluated using the given metadata encodings as well as standards known to support major search engines such as Dublin Core or schema.org or DCAT encoded in microdata, RDFa, embedded JSON-LD or meta tags according to the webmaster guidelines published by e.g. Google or Bing.
test_scoring_mechanism: cumulative
metric_tests:
- metric_test_identifier: FsF-F4-01M-1
metric_test_name: Metadata is given in a way major search engines can ingest it for their catalogues (Dublin Core or schema.org or DCAT encoded in microdata, RDFa, embedded JSON-LD or meta tags see e.g. Google Dataset Search webmaster guidelines)
metric_test_score: 2
metric_test_maturity: 3
metric_test_requirements:
- expected_values: http://f-uji.net/vocab/metadata/standard
modality: any
required:
name:
- dublin-core
- schemaorg
- dcat-data-catalog-vocabulary
- expected_values: http://f-uji.net/vocab/metadata/offering_method
modality: any
required:
name:
- rdfa
- microdata
- meta_tag
- json_in_html
created_by: FAIRsFAIR
date_created: 2020-07-08
date_updated: 2022-05-30
version: 0.5
total_score: 2
- metric_identifier: FsF-A1-01M
metric_number: 8
metric_short_name: Data Access Information
metric_name: Metadata contains access level and access conditions of the data.
description: This metric determines if the metadata includes the level of access to the data such as public, embargoed, restricted, or metadata-only access and its access conditions. Both access level and conditions are necessary information to potentially gain access to the data. It is recommended that data should be as open as possible and as closed as necessary. There are no access conditions for public data. Datasets should be released into the public domain (e.g., with an appropriate public-domain-equivalent license such as Creative Commons CC0 licence) and openly accessible without restrictions when possible. Embargoed access refers to data that will be made publicly accessible at a specific date which should be specified in the metadata. For example, a data author may release their data after having published their findings from the data. Therefore, access conditions such as the date the data will be released publically is essential. Restricted access refers to data that can be accessed under certain conditions (e.g. because of commercial, sensitive, or other confidentiality reasons or the data is only accessible via a subscription or a fee). Restricted data may be available to a particular group of users or after permission is granted. For restricted data, the metadata should include the conditions of access to the data such as point of contact or instructions to access the data. Metadata-only access refers to data that is not made publicly available and for which only metadata is publicly available.
fair_principle: A1
target: Metadata
evaluation_mechanism: Metric evaluation is based on the presence of access information in an appropriate metadata element/field.
test_scoring_mechanism: alternative
metric_tests:
- metric_test_identifier: FsF-A1-01M-1
metric_test_name: Information about access restrictions or rights can be identified in metadata
metric_test_score: 1
metric_test_maturity: 3
metric_test_requirements:
- expected_values: http://f-uji.net/vocab/metadata/property/access_level
modality: any
created_by: FAIRsFAIR
date_created: 2020-07-08
date_updated: 2020-12-03
version: 0.5
total_score: 1
- metric_identifier: FsF-A1-02MD
metric_number: 9
metric_short_name: Retrievable Metadata and Data
metric_name: Metadata and data are retrievable by their identifier
fair_principle: A1
target: Metadata and Data
description: This metric determines whether data and metadata are accessible via their identifiers, i.e. whether the identifiers resolve to a target that actually contains data or metadata.
evaluation_mechanism: It is checked whether identifiers are actionable and successfully resolve to a target that actually contains data or metadata.
test_scoring_mechanism: cumulative
metric_tests:
- metric_test_identifier: FsF-A1-02MD-1
metric_test_name: Metadata are retrievable via their specified identifier
metric_test_score: 0.5
metric_test_maturity: 3
- metric_test_identifier: FsF-A1-02MD-2
metric_test_name: Data are retrievable via the identifiers given in metadata
metric_test_score: 0.5
metric_test_maturity: 3
created_by: FAIRsFAIR
date_created: 2024-12-13
date_updated: 2024-12-13
version: 0.6
total_score: 1
- metric_identifier: FsF-A1.1-01MD
metric_number: 11
metric_short_name: Standardized Communication Protocol of Metadata and Data
metric_name: A standardized communication protocol is used to access metadata and data.
description: Given an identifier of a dataset, the metadata of the dataset as well as related data should be retrievable using a standard communication protocol such as HTTP, HTTPS, FTP, TFTP, SFTP, FTAM etc. Avoid disseminating data using a proprietary protocol.
fair_principle: A1.1
target: Metadata and Data
evaluation_mechanism: The URI scheme of resources potentially leading to metadata are tested for a standard communication protocol
test_scoring_mechanism: alternative
metric_tests:
- metric_test_identifier: FsF-A1.1-01MD-1
metric_test_name: Identifier leading to metadata matches a scheme indicating a standardized web communication protocol.
metric_test_score: 0.5
metric_test_maturity: 3
- metric_test_identifier: FsF-A1.1-01MD-2
metric_test_name: Identifier leading to data are matching a schema indicating a standardized web communication protocol.
metric_test_score: 0.5
metric_test_maturity: 3
created_by: FAIRsFAIR
date_created: 2024-12-16
date_updated: 2024-12-16
version: 0.6
total_score: 1
- metric_identifier: FsF-A1.2-01MD
metric_number: 12
metric_short_name: Authentication Support of Protocol for Metadata and Data
metric_name: Metadata and data are accessible through a standardized communication protocol which supports authentication.
description: Given an identifier of a dataset, the metadata of the dataset as well as related data should be retrievable using a standard communication protocol which supports authentication such as HTTP, HTTPS, FTPS.
fair_principle: A1.2
target: Metadata and Data
evaluation_mechanism: The URI scheme of the landing page is tested for a standard communication protocol which supports authentication
test_scoring_mechanism: alternative
metric_tests:
- metric_test_identifier: FsF-A1.2-01MD-1
metric_test_name: The communication protocol found in identifiers (IRIs) leading to metadata supports authentication.
metric_test_score: 0.5
metric_test_maturity: 3
- metric_test_identifier: FsF-A1.2-01MD-2
metric_test_name: The communication protocol identified in data links (IRIs) supports authentication.
metric_test_score: 0.5
metric_test_maturity: 3
created_by: FAIRsFAIR
date_created: 2024-12-16
date_updated: 2024-12-16
version: 0.6
total_score: 1
#- metric_identifier: FsF-A2-01M
# metric_number: 9
# metric_short_name: Metadata Preservation
# metric_name: Metadata remains available, even if the data is no longer available.
# description: This metric determines if the metadata will be preserved even when the data they represent are no longer available, replaced or lost.
# fair_principle: A2
# target: Metadata
# evaluation_mechanism: Currently this metric can only be assessed using the persistent identifier as an indicator. DOI metadata is preserved by DataCite.
# metric_tests:
# - metric_test_identifier: FsF-A2-01M-1
# metric_test_name: The persistent identifier system used guarantees the preservation of associated metadata
# metric_test_score: 1
# metric_test_maturity: 3
# created_by: FAIRsFAIR
# date_created: 2020-07-08
# date_updated: 2020-12-05
# version: 0.5
# total_score: 1
- metric_identifier: FsF-I1-01M
metric_number: 13
metric_short_name: Formal Representation of Metadata
metric_name: Metadata is represented using a formal knowledge representation language.
description: Knowledge representation is vital for machine-processing of the knowledge of a domain. Expressing the metadata of a data object using a formal knowledge representation will enable machines to process it in a meaningful way and enable more data exchange possibilities. Examples of knowledge representation languages are RDF, RDFS, and OWL. These languages may be serialized (written) in different formats. For instance, RDF/XML, RDFa, Notation3, Turtle, N-Triples and N-Quads, and JSON-LD are RDF serialization formats.
fair_principle: I1
target: Metadata
evaluation_mechanism: Metadata has to be serialised in a common formal knowledge representation language.
test_scoring_mechanism: cumulative
metric_tests:
- metric_test_identifier: FsF-I1-01M-1
metric_test_name: Parsable, structured metadata (JSON-LD, RDFa) is embedded in the landing page XHTML/HTML code
metric_test_score: 1
metric_test_maturity: 2
metric_test_requirements:
- expected_values: http://f-uji.net/vocab/metadata/format
modality: any
required:
name:
- RDF
- JSON-LD
- RDFa
- expected_values: http://f-uji.net/vocab/metadata/offering_method
modality: any
required:
name:
- meta_tag
- microdata
- rdfa
- json_in_html
- metric_test_identifier: FsF-I1-01M-2
metric_test_name: Parsable, structured metadata (RDF, JSON-LD) is accessible through content negotiation, typed links or sparql endpoint
metric_test_score: 1
metric_test_maturity: 3
metric_test_requirements:
- expected_values: http://f-uji.net/vocab/metadata/format
modality: any
required:
name:
- RDF
- JSON-LD
- RDFa
- expected_values: http://f-uji.net/vocab/metadata/offering_method
modality: any
required:
name:
- content_negotiation
- expected_values: http://f-uji.net/vocab/metadata/exchange_service
modality: any
required:
name:
- sparql
created_by: FAIRsFAIR
date_created: 2020-07-08
date_updated: 2023-06-01
version: 0.5
total_score: 2
- metric_identifier: FsF-I2-01M
metric_number: 14
metric_short_name: Metadata with Semantic Resources
metric_name: Metadata uses semantic resources
description: A metadata document or selected parts of the document may incorporate additional terms from semantic resources (also referred as semantic artefacts) so that the contents are unambiguous and can be processed automatically by machines. This enrichment facilitates enhanced data search and interoperability of data from different sources. Ontology, thesaurus, and taxonomy are kinds of semantic resources, and they come with varying degrees of expressiveness and computational complexity. Knowledge organization schemes such as thesaurus and taxonomy are semantically less formal than ontologies.
fair_principle: I2
target: Metadata
evaluation_mechanism: Used namespaces are identified in given graph or XML metadata and verified using a controlled list.
test_scoring_mechanism: cumulative
metric_tests:
- metric_test_identifier: FsF-I2-01M-2
metric_test_name: Metadata uses terms from registered vocabularies that are identified by their namespaces
metric_test_score: 1
metric_test_maturity: 3
metric_test_requirements:
- expected_values: http://f-uji.net/vocab/semantic_resource
modality: any
created_by: FAIRsFAIR
date_created: 2020-07-08
date_updated: 2020-12-03
version: 0.5
total_score: 1
- metric_identifier: FsF-I3-01M
metric_number: 15
metric_short_name: Qualified references to related entities
metric_name: Metadata includes qualified references between the data and its related entities.
description: Linking data to its related entities will increase its potential for reuse. The linking information should be captured as part of the metadata. A dataset may be linked to its prior version, related datasets or resources (e.g. publication, physical sample, funder, repository, platform, site, or observing network registries). Links between data and its related entities should be qualified, thus expressed through relation types (e.g., DataCite Metadata Schema specifies relation types between research objects through the fields ‘RelatedIdentifier’ and ‘RelationType’), and preferably use persistent Identifiers for related entities.
fair_principle: I3
target: Metadata
evaluation_mechanism: Metadata is checked for the presence of (typed, default = related) related resource properties containing information about existing relations to related entities which can be e.g. citations or other related resources
metric_tests:
- metric_test_identifier: FsF-I3-01M-1
metric_test_name: Related resources are referenced in plain text within appropriate metadata properties indicating the relation type
metric_test_score: 1
metric_test_maturity: 2
metric_test_requirements:
- expected_values: http://f-uji.net/vocab/relation_type
modality: any
tested_on: http://f-uji.net/vocab/metadata/property/related_resources
comment: The presence of a (typed, default = related) related resource is checked which is expressed as plain text
- metric_test_identifier: FsF-I3-01M-2
metric_test_name: Related resources are referenced by machine readable links or identifiers within appropriate metadata properties indicating the relation type
metric_test_requirements:
- comment: same as above but relations have to be machine readable/actionable
metric_test_score: 1
metric_test_maturity: 3
created_by: FAIRsFAIR
date_created: 2020-07-08
date_updated: 2020-12-03
version: 0.5
total_score: 1
- metric_identifier: FsF-R1-01M
metric_number: 16
metric_short_name: Metadata of Data Content
metric_name: Metadata specifies the content of the data.
description: This metric evaluates if a description (properties) of the content of the data is specified in the metadata. The description should be an accurate reflection of the actual data deposited. Data content descriptors include but are not limited to resource type (e.g., data or a collection of data), variable(s) measured or observed, data format and size or data service and protocol. Ideally, ontological vocabularies should be used to describe data content to support interdisciplinary reuse.
fair_principle: R1
target: Metadata, Data
evaluation_mechanism: Metric is evaluated using the resource type given in the metadata as well as data object specific properties file size and file type. Further presence of measured variables is tested.
test_scoring_mechanism: cumulative
metric_tests:
- metric_test_identifier: FsF-R1-01M-1
metric_test_name: Minimum information (resource type) about the available data content is specified in the metadata
metric_test_score: 2
metric_test_maturity: 1
- metric_test_identifier: FsF-R1-01M-2
metric_test_name: Information on the manner and form (file size and type or service (API) endpoint and protocol) in which data is delivered is provided
metric_test_score: 2
metric_test_maturity: 3
- metric_test_identifier: FsF-R1-01M-3
metric_test_name: Measured variables or observation types are specified in metadata
metric_test_requirements:
- comment: test is not scored
metric_test_score: 0
metric_test_maturity: 3
created_by: FAIRsFAIR
date_created: 2020-07-08
date_updated: 2020-07-08
version: 0.5
total_score: 4
- metric_identifier: FsF-R1.1-01M
metric_number: 17
metric_short_name: Data Usage License
metric_name: Metadata includes license information under which data can be reused.
description: This metric evaluates if data is associated with a license because otherwise users cannot reuse it in a clear legal context. We encourage the application of licenses for all kinds of data whether public, restricted or for specific users. Without an explicit license, users do not have a clear idea of what can be done with your data. Licenses can be of standard type (Creative Commons, Open Data Commons Open Database License) or bespoke licenses, and rights statements which indicate the conditions under which data can be reused. It is highly recommended to use a standard, machine-readable license such that it can be interpreted by machines and humans. In order to inform users about what rights they have to use a dataset, the license information should be specified as part of the dataset’s metadata.
fair_principle: R1.1
target: Metadata
evaluation_mechanism: Metric evaluation is based on the presence of a machine readable license information in an appropriate metadata element/field.
test_scoring_mechanism: cumulative
metric_tests:
- metric_test_identifier: FsF-R1.1-01M-1
metric_test_name: Licence information is given in an appropriate metadata element
metric_test_score: 2
metric_test_maturity: 3
created_by: FAIRsFAIR
date_created: 2020-07-08
date_updated: 2023-06-02
version: 0.5
total_score: 2
- metric_identifier: FsF-R1.2-01M
metric_number: 18
metric_short_name: Data Provenance
metric_name: Metadata includes provenance information about data creation or generation.
description: >-
Data provenance (also known as lineage) represents a dataset’s history, including the people, entities, and processes involved in its creation, management and longer-term curation.
It is essential to provide provenance information about your data to provide valuable context and to enable informed use and reuse.
The levels of provenance information needed can vary depending on the data type (e.g., measurement, observation, derived data, or data product) and research domains.
For that reason, it is difficult to define a set of finite provenance properties that will be adequate for all domains.
Based on existing work, we suggest that the following provenance properties of data generation or collection are included in the metadata record as a minimum.
(a) Sources of data, e.g., datasets and other resources the data is derived from
(b) Data creation or collection date
(c) Contributors involved in data creation and their roles
(d) Data publication, modification and versioning information
There are various ways through which provenance information may be included in a metadata record.
Provenance information can be given in a linked provenance record expressed explicitly in e.g., PROV-O or PAV or Vocabulary of Interlinked Datasets (VoID).
Alternatively suitable metadata properties can be used. For example, Dublin Core has been mapped to PROV in PROV-DC which allows further mappings of other metadata standards to PROV.
fair_principle: R1.2
target: Metadata
evaluation_mechanism: Metrics are assessed using provenance related information contained in metadata which can either be specific elements which can be mapped e.g. to PROV-O or the use of provenance related namespaces and associated terms.
test_scoring_mechanism: alternative
metric_tests:
- metric_test_identifier: FsF-R1.2-01M-1
metric_test_name: Metadata contains elements which hold provenance information and can be mapped to PROV based on PROV-DC
metric_test_score: 2
metric_test_maturity: 2
- metric_test_identifier: FsF-R1.2-01M-2
metric_test_name: Metadata contains provenance information using formal provenance ontologies (PROV, PAV)
metric_test_score: 2
metric_test_maturity: 3
created_by: FAIRsFAIR
date_created: 2020-07-08
date_updated: 2023-06-01
version: 0.5
total_score: 2
- metric_identifier: FsF-R1.3-01M
metric_number: 19
metric_short_name: Community-Endorsed Metadata Standard
metric_name: Metadata follows a standard recommended by the target research community of the data.
description: In addition to core metadata required to support data discovery (covered under metric FsF-F2-01M), metadata to support data reusability should be made available following community-endorsed metadata standards. Some communities have well-established metadata standards (e.g., geospatial [ISO19115], biodiversity [DarwinCore, ABCD, EML], social science [DDI], astronomy [International Virtual Observatory Alliance Technical Specifications]) while others have limited standards or standards that are under development (e.g., engineering and linguistics). The use of community-endorsed metadata standards is usually encouraged and supported by domain and discipline-specific repositories.
fair_principle: R1.3
target: Metadata
evaluation_mechanism: Metadata encodings can be verified using community specific namespaces and schemas listed by the RDA metadata standards WG or fairsharing.org
test_scoring_mechanism: alternative
metric_tests:
- metric_test_identifier: FsF-R1.3-01M-1
metric_test_name: Community specific metadata standard is detected using namespaces or schemas found in provided metadata
metric_test_score: 1
metric_test_maturity: 3
metric_test_requirements:
- modality: any except
expected_values: https://f-uji.net/vocab/metadata/standards
required:
field_of_science:
- science
- generic
comment: test performed on namespaces or schemas found in exposed metadata
- metric_test_identifier: FsF-R1.3-01M-3
metric_test_name: Multidisciplinary but community endorsed metadata (RDA Metadata Standards Catalog, fairsharing) standard is detected by namespace
metric_test_score: 1
metric_test_maturity: 1
metric_test_requirements:
- modality: any
expected_values: https://f-uji.net/vocab/metadata/standards
required:
field_of_science:
- science
- generic
source:
- rd-alliance.org
- fairsharing.org
created_by: FAIRsFAIR
date_created: 2020-07-08
date_updated: 2020-12-03
version: 0.5
total_score: 1
- metric_identifier: FsF-R1.3-02D
metric_number: 20
metric_short_name: Data File format
metric_name: Data is available in a file format recommended by the target research community.
description: >-
File formats refer to methods for encoding digital information. For example, CSV for tabular data, NetCDF for multidimensional data and GeoTIFF for raster imagery. Data should be made available in a file format that is backed by the research community to enable data sharing and reuse. Consider for example, file formats that are widely used and supported by the most commonly used software and tools. These formats also should be suitable for long-term storage and archiving, which are usually recommended by a data repository. The formats not only give a higher certainty that your data can be read in the future, but they will also help to increase the reusability and interoperability. Using community-endorsed formats enables data to be loaded directly into the software and tools used for data analysis. It makes it possible to easily integrate your data with other data using the same preferred format. The use of preferred formats will also help to transform the format to a newer one, in case a preferred format gets outdated.
fair_principle: R1.3
target: Data
evaluation_mechanism: Data file format given in metadata is compared to a controlled list of known open, long term or scientific formats.
test_scoring_mechanism: alternative
metric_tests:
- metric_test_identifier: FsF-R1.3-02D-1
metric_test_name: Data is available in a file format recommended by the research community (long term file formats, open file formats or scientific file format)
metric_test_score: 1
metric_test_maturity: 3
created_by: FAIRsFAIR
date_created: 2020-07-08
date_updated: 2020-12-03
version: 0.5
total_score: 1
metric_specification: 10.5281/zenodo.6461229