created at 2022-10-02 20:34:27.287+02:00 [Europe/Amsterdam]
download as xml json

Collection Report

Collection name: Tubingen Archive of Language Resources TALAR

Total Score: 7146.9160 out of 10095.0000

Score percentage: 70.8%

Average Score: 10.62 out of 15.00

Maximal score in collection: 11.36

Minimal score in collection: 9.36


File Section

General information on the number of files and the file size.

Number of files: 673

Total size: 30572194 B

Average size: 45426 B

Minimal file size: 2530 B

Maximal file size: 4814975 B


Header Section

The header section shows information on the profile usage in the collection.
Important note: the score of this section differs from the score of the underlying profile. For more information on scoring have a look at the FAQ , please.

Profiles in Collection
ID Score Count
Total number of profiles: 13
clarin.eu:cr1:p_1290431694579 2.80 1
clarin.eu:cr1:p_1524652309874 2.73 1
clarin.eu:cr1:p_1524652309872 2.73 1
clarin.eu:cr1:p_1527668176128 1.74 2
clarin.eu:cr1:p_1527668176122 1.73 137
clarin.eu:cr1:p_1527668176123 1.72 54
clarin.eu:cr1:p_1527668176126 1.72 123
clarin.eu:cr1:p_1527668176125 1.72 13
clarin.eu:cr1:p_1548239945774 1.72 6
clarin.eu:cr1:p_1527668176124 1.71 12
clarin.eu:cr1:p_1288172614023 1.64 265
clarin.eu:cr1:p_1320657629644 1.51 57
clarin.eu:cr1:p_1288172614026 1.12 1

Facet Section

The facet section shows the facet coverage within the collection. It's quite evident that the facet coverage of a certain CMD file can't be higher than those of the profile it is based on.

name coverage
facet-coverage: 100.0%
languageCode 23.2%
collection 95.1%
resourceClass 40.4%
modality 21.2%
format 0.2%
keywords 8.2%
genre 8.3%
subject 3.3%
country 30.9%
organisation 5.1%
name 80.5%
description 42.5%
license 2.8%
availability 33.0%

ResourceProxy Section

The resource proxy section shows information on the number of resource proxies on the kind (the mime type) of resources. A resource proxy is a link to an external resource, described by the CMD file.

Total number of resource proxies: 22136

Average number of resource proxies: 32.89

Total number of resource proxies with MIME: 22048

Average number of resource proxies with MIME: 32.76

Total number of resource proxies with reference: 22136

Average number of resource proxies with references: 32.89


XML Validation Section

The XML validation section shows the result of a simple validation of each CMD file against its profile.

Number of Records: 673

Number of valid Records: 671

Ratio valid Records: 99.7%

Invalid Records:

File Info Validate
clarin/results/cmdi/Tubingen_Archive_of_Language_Resources_TALAR_/http_127_0_0_1_8080_erdora_rest_DATENZENTRUM_TLT_2013_Deriving_Multi_Headed_Projective_Dep.xml
clarin/results/cmdi/Tubingen_Archive_of_Language_Resources_TALAR_/http_127_0_0_1_8080_erdora_rest_SFB833_A03_TACL_datasets_code.xml

XML Populated Section

The XML populated section shows information on the number of xml elements and the fact if these elements are conatining data.

Total number of XML elements: 406000

Average number of XML elements: 603.27

Total number of simple XML elements: 256871

Average number of simple XML elements: 381.68

Total number of empty XML elements: 48832

Average number of empty XML elements: 72.56

Average rate of populated elements: 81.0%


URL Validation Section

The URL validation section shows information on the number of links and the results of link checking for the links which have been checked so far.

Total number of links: 22136

Average number of links: 32.89

Total number of unique links: 22103

Total number of checked links: 22103

Total number of undetermined links: 0

Average number of unique links: 32.84

Total number of broken links: 6216

Average number of broken links: 9.24

Ratio of valid links: 71.9%

Link Checking Results

Category Count Average Response Duration(ms) Max Response Duration(ms)
Ok 15867 50.91 4,151
Restricted_Access 20 26.15 31
Broken 6216 27.15 1,067