Collection Report

Collection name: COllections de COrpus Oraux Numeriques CoCoON ex CRDO

URL: https://curation.clarin.eu/collection/COllections_de_COrpus_Oraux_Numeriques_CoCoON_ex_CRDO_.xml

Total Score: 194801.6183 out of 251370.0000

Score percentage: 77.5%

Average Score: 11.62 out of 15.00

Maximal score in collection: 11.72

Minimal score in collection: 11.25


Creation time: 2021-12-01 21:33:30.778+01:00 [Europe/Vienna]


File Section

Number of files: 16758

Total size: 99599732 B

Average size: 5943 B

Minimal file size: 2986 B

Maximal file size: 39001 B


Header Section

Profiles in Collection
ID Score Count
Total number of profiles: 1
clarin.eu:cr1:p_1288172614026 1.12 16758

Facet Section

name coverage
facet-coverage: 64.3%
languageCode 99.3%
collection 100.0%
resourceClass 100.0%
modality 0.0%
format 99.0%
keywords 0.0%
genre 0.0%
subject 96.9%
country 0.0%
organisation 0.0%
name 100.0%
description 72.7%
license 92.0%
availability 92.0%

ResourceProxy Section

Total number of resource proxies: 76272

Average number of resource proxies: 4.55

Total number of resource proxies with MIME: 0

Average number of resource proxies with MIME: 0.00

Total number of resource proxies with reference: 76272

Average number of resource proxies with references: 4.55


XML Validation Section

Number of Records: 16758

Number of valid Records: 16758

Ratio valid Records: 100.0%


XML Populated Section

Total number of XML elements: 1079095

Average number of XML elements: 64.39

Total number of simple XML elements: 902275

Average number of simple XML elements: 53.84

Total number of empty XML elements: 94914

Average number of empty XML elements: 5.66

Average rate of populated elements: 89.5%


URL Validation Section

Total number of links: 76272

Average number of links: 4.55

Total number of unique links: 76271

Total number of checked links: 76271

Total number of undetermined links: 2789

Average number of unique links: 4.55

Total number of broken links: 421

Average number of broken links: 0.03

Ratio of valid links: 99.4%

Link Checking Results

Category Count Average Response Duration(ms) Max Response Duration(ms)
Blocked_By_Robots_txt 116 0 0
Broken 421 99.6 1,704
Ok 72945 340.8 9,183
Undetermined 2789 37.63 177