created at 2023-05-03 20:05:55
download as xml json

Collection Report

Collection name: Instituut voor Nederlandse Lexicologie INL Metadata Repository

Section Score Score Percentage
total 1,335.5 / 1,621.8 82.3%
FileReport 113.0 / 114.0 99.1%
ProfileReport 264.4 / 339.0 78.0%
HeaderReport 560.0 / 565.0 99.1%
ResProxyReport 127.3 / 226.0 56.3%
XmlPopulationReport 86.0 / 113.0 76.1%
XmlValidityReport 112.0 / 113.0 99.1%
LinkcheckerReport 37.1 / 38.8 95.6%
FacetReport 35.7 / 113.0 31.6%

The above table is based on 113 processable files. There are also 1 files in the collection that could not be processed. See record details table for more information.

File Section

General information on the number of files and the file size.

Number of files: 114

Number of processable files: 113

Total size: 2,778,436 B

Average size: 24,372 B

Minimal file size: 2,943 B

Maximal file size: 1,526,482 B


Header Section

The header section shows information on the availibilty of attribute schemaLocation as well as the elements MdSelfLink, MdProfile and MdCollectionDisplayName.

Number of files with schemaLocation:113

Number of files where schemaLocation is CR resident: 113

Number of files with MdProfile: 113

Number of files with MdSelfLink: 112

Number of files with MdCollectionDisplayName: 112


Profile Usage Section

The profile usage section shows information shows which profiles are used how oftenly in a collection. collection.

ID Is public Score Count
Total number of profiles: 15
clarin.eu:cr1:p_1271859438164 true 2.72 11
clarin.eu:cr1:p_1290431694581 true 2.69 3
clarin.eu:cr1:p_1381926654456 true 2.68 11
clarin.eu:cr1:p_1371047304769 true 2.67 1
clarin.eu:cr1:p_1361876010608 true 2.66 4
clarin.eu:cr1:p_1387365569700 true 2.65 13
clarin.eu:cr1:p_1380106710823 true 2.65 2
clarin.eu:cr1:p_1380106710825 true 2.60 3
clarin.eu:cr1:p_1361876010587 true 2.59 21
clarin.eu:cr1:p_1272022528363 true 2.55 7
clarin.eu:cr1:p_1271859438162 true 2.42 2
clarin.eu:cr1:p_1320657629667 true 1.93 1
clarin.eu:cr1:p_1527668176123 false 1.72 3
clarin.eu:cr1:p_1524652309875 false 1.72 10
clarin.eu:cr1:p_1299509410083 false 1.65 21

Facet Section

The facet section shows the facet coverage within the collection. It's quite evident that the facet coverage of a certain CMD file can't be higher than those of the profile it is based on.

name coverage
average facet-coverage: 31.6%
languageCode 25.4%
collection 98.2%
resourceClass 43.0%
modality 48.2%
format 18.4%
keywords 0.0%
genre 9.6%
subject 6.1%
country 36.0%
organisation 31.6%
name 52.6%
description 44.7%
license 3.5%
availability 21.1%

ResourceProxy Section

The resource proxy section shows information on the number of resource proxies on the kind (the mime type) of resources. A resource proxy is a link to an external resource, described by the CMD file.

Total number of resource proxies: 153

Average number of resource proxies: 1.34

Total number of resource proxies with MIME: 31

Average number of resource proxies with MIME: 0.27

Total number of resource proxies with reference: 153

Average number of resource proxies with references: 1.34


XML Validation Section

The XML validation section shows the result of a simple validation of each CMD file against its profile.

Number of XML valid Records: 112

Ratio XML valid Records: 99.1%


XML Populated Section

The XML populated section shows information on the number of xml elements and the fact if these elements are conatining data.

Total number of XML elements: 16,188

Average number of XML elements: 142.00

Total number of simple XML elements: 10,301

Average number of simple XML elements: 90.36

Total number of empty XML elements: 2,946

Average number of empty XML elements: NaN

Average rate of populated elements: 71.4%


URL Validation Section

The URL validation section shows information on the number of links and the results of link checking for the links which have been checked so far.

Total number of links: 265

Average number of links: 2.32

Total number of unique links: 265

Average number of unique links: 2.32

Total number of checked links: 91

Ratio of valid links: 95.6%

Link Checking Results

Category Count Average Response Duration(ms) Max Response Duration(ms)
Ok 87 708.6 4,150.0
Broken 2 3,275.0 3,275.0
Invalid_URL 2 NaN NaN


Record details:

The record details section shows the particalarities of each record as far as they're of importance for the data provider.

File Info Validate
clarin/results/cmdi/Instituut_voor_Nederlandse_Lexicologie_INL_Metadata_Repository/CorpusGysseling.xml
clarin/results/cmdi/Instituut_voor_Nederlandse_Lexicologie_INL_Metadata_Repository/306dfa96c48e12b91c33d50aea383b76.xml
clarin/results/cmdi/Instituut_voor_Nederlandse_Lexicologie_INL_Metadata_Repository/clarin_center_ivdnt.xml