Collection Report

Collection name: Tubingen Archive of Language Resources TALAR

URL: https://curation.clarin.eu/collection/Tubingen_Archive_of_Language_Resources_TALAR_.xml

Total Score: 7685.4706 out of 10050.0000

Score percentage: 76.5%

Average Score: 11.47 out of 15.00

Maximal score in collection: 12.34

Minimal score in collection: 8.84


Creation time: 2021-12-01 21:47:50.177+01:00 [Europe/Vienna]


File Section

Number of files: 670

Total size: 32010710 B

Average size: 47777 B

Minimal file size: 2556 B

Maximal file size: 5006448 B


Header Section

Profiles in Collection
ID Score Count
Total number of profiles: 13
clarin.eu:cr1:p_1527668176128 2.74 2
clarin.eu:cr1:p_1527668176123 2.72 54
clarin.eu:cr1:p_1527668176126 2.72 121
clarin.eu:cr1:p_1527668176125 2.72 13
clarin.eu:cr1:p_1548239945774 2.72 6
clarin.eu:cr1:p_1527668176124 2.71 12
clarin.eu:cr1:p_1288172614023 2.64 265
clarin.eu:cr1:p_1320657629644 2.51 56
clarin.eu:cr1:p_1288172614026 2.12 1
clarin.eu:cr1:p_1290431694579 1.80 1
clarin.eu:cr1:p_1527668176122 1.73 137
clarin.eu:cr1:p_1524652309874 1.73 1
clarin.eu:cr1:p_1524652309872 1.73 1

Facet Section

name coverage
facet-coverage: 100.0%
languageCode 41.0%
collection 95.2%
resourceClass 55.1%
modality 26.6%
format 1.3%
keywords 11.3%
genre 10.0%
subject 4.3%
country 45.2%
organisation 8.1%
name 93.7%
description 54.2%
license 3.4%
availability 42.1%

ResourceProxy Section

Total number of resource proxies: 22474

Average number of resource proxies: 33.54

Total number of resource proxies with MIME: 22386

Average number of resource proxies with MIME: 33.41

Total number of resource proxies with reference: 22474

Average number of resource proxies with references: 33.54


XML Validation Section

Number of Records: 670

Number of valid Records: 668

Ratio valid Records: 99.7%

Invalid Records:

File Info Validate
clarin/results/cmdi/Tubingen_Archive_of_Language_Resources_TALAR_/http_127_0_0_1_8080_erdora_rest_DATENZENTRUM_TLT_2013_Deriving_Multi_Headed_Projective_Dep.xml
clarin/results/cmdi/Tubingen_Archive_of_Language_Resources_TALAR_/http_127_0_0_1_8080_erdora_rest_SFB833_A03_TACL_datasets_code.xml

XML Populated Section

Total number of XML elements: 405896

Average number of XML elements: 605.81

Total number of simple XML elements: 257017

Average number of simple XML elements: 383.61

Total number of empty XML elements: 49160

Average number of empty XML elements: 73.37

Average rate of populated elements: 80.9%


URL Validation Section

Total number of links: 22474

Average number of links: 33.54

Total number of unique links: 22441

Total number of checked links: 20177

Total number of undetermined links: 1

Average number of unique links: 33.49

Total number of broken links: 7529

Average number of broken links: 11.24

Ratio of valid links: 62.7%

Link Checking Results

Category Count Average Response Duration(ms) Max Response Duration(ms)
Broken 7529 30.19 3,097
Ok 12647 56.94 7,441
Undetermined 1 0 0