created at 2022-06-28 20:00:12.746+02:00 [Europe/Amsterdam]
download as xml json

CMD Profile Report

Name media-corpus-profile
Description General purpose metadata description of a collection of media files (= recordings) and annotation files and optional documentation. This profile is typically (but not necessarily) used in conjuntion with a collection of CMDIs that encode the individual recording sessions belonging to the corpus (e.g. media-session-profile); pointers to these session CMDIs should be listed as links of type 'Metadata' in the ResourceProxyList of the profile. Version 1.1 : extended by components MultimodalCorpus and AnnotationInfo defined by CLARIN-D F-AG 6 to improve encoding of multimodal data; component cmdi-speech-corpus is now optional; it is recommended to use either the component cmdi-speech-corpus for classical spoken language corpora, or the component MultimodalCorpus for corpora of multimodal data; the (optional) component AnnotationInfo may be used for homogenous corpora with identical annotations in all recording sessions, otherwise use the corresponding components in the session CMDI records (e.g. media-session-profile).
Schema Location
CMDI Version 1.2
Status production

Score Section

The scoring is based on public state of the profile, the percentage of elements (except header and resources) with concept and percentage of defined facets covered by the profile.
For details on scoring, have a look at the FAQ , please.

Segment Score Max
Total: 2.86 Max: 3.00
header-section 1.00 1.00
cmd-concepts-section 0.86 1.00
facets-section 1.00 1.00

Facets Section

The facet section shows if a specific facet is covered by the profile. In other words, if the profile defines an element for the facet.

Name Covered
Covered: 14 / 14 Coverage: 100.0%
languageCode true
collection true
resourceClass true
modality true
format true
keywords true
genre true
subject true
country true
organisation true
name true
description true
license true
availability true

Usage Section

The usage section shows in which collection the profile is used

Collection Usage
BAS_Repository 50

Cmd Component Section

The components section shows information on the kind, id and the usage of concepts in the profile.
For more information on componets, have a look at the Component Registry Documentation , please.

Name Id Count
Total: 56 Unique: 37 Required: 18
Collection 1
GeneralInfo 1
Description 11
Location 1
Country 1
Continent 1
Project 1
Contact 3
Duration 1
Creators 1
Creator 1
DocumentationLanguages 1
Language 3
ISO639 3
Access 1
Price 1
CollectionType 1
Corpus 1
Multilinguality 1
AnnotationTypes 2
AnnotationType 2
Size 1
TotalSize 1
SizePerLanguage 1
SubjectLanguages 1
SubjectLanguage 1
Modality 1
Validation 1
SpeechCorpus 1
SpeechTechnicalMetadata 1
MimeType 1
MultimodalCorpus 1
ModalityInfo 1
Descriptions 2
AnnotationInfo 1
AnnotationFormat 1
AnnotationToolInfo 1

Cmd Concepts Section

The concepts section shows information on the kind, state and the number of concepts used in the profile.
For more information on concepts, have a look at the CLARIN Concept Registry , please.

Total number of elements: 125

Number of required elements: 36

Number of elements with specified concept: 108

Percentage of elements with specified concept: 86.4%

Concept Status Count
Total: 108 Unique: 67 Required: 36
language ID APPROVED 4
resource name APPROVED 1
resource title APPROVED 1
persistent identifier APPROVED 1
version APPROVED 2
Legal Owner CANDIDATE 1
publication date APPROVED 1
description APPROVED 13
location address APPROVED 1
location region APPROVED 1
location country APPROVED 1
location continent APPROVED 1
project name APPROVED 1
project title APPROVED 1
project id APPROVED 1
funder APPROVED 1
address APPROVED 3
email APPROVED 3
Organisation CANDIDATE 3
telephone number APPROVED 3
start year APPROVED 1
completion year APPROVED 1
creator role APPROVED 1
language name APPROVED 3
availability APPROVED 1
Distribution medium CANDIDATE 1
Catalogue link CANDIDATE 1
price APPROVED 1
resource class CANDIDATE 1
topic APPROVED 1
number of languages APPROVED 1
annotation level type APPROVED 2
size unit APPROVED 2
dominant language APPROVED 1
source language APPROVED 1
target language APPROVED 1
modalities APPROVED 2
validation APPROVED 1
validation type APPROVED 1
validation mode APPROVED 1
validation level APPROVED 1
duration of effective speech APPROVED 1
duration of full database APPROVED 2
number of speakers APPROVED 1
recording environment APPROVED 2
Speaker Demographics CANDIDATE 1
quality APPROVED 2
recording platform hardware APPROVED 2
recording platform software APPROVED 2
sample rate APPROVED 1
Number of Channels CANDIDATE 1
byte order APPROVED 1
compression APPROVED 1
bit resolution APPROVED 1
sample coding CANDIDATE 1
mime type APPROVED 1
duration of effective production CANDIDATE 1
number of actors CANDIDATE 1
actor demographics CANDIDATE 1
annotation mode APPROVED 1
annotation format APPROVED 1
annotation tool CANDIDATE 1
tool type CANDIDATE 1