ORTOLANG Deposit and sharing

Speech and Language Data Repository (SLDR/ORTOLANG)

Investissements d'avenir  Huma-Num  CLARIN

Open archives (OAI-PMH)

Lost in migration ? This document may help you : infoMigration-en_0.2.pdf

New deposits and registration of new users must now be done on Ortolang platform.

As of 01/12/2015, deposit of data on SLDR website will be suspended to allow the public opening of Ortolang platform.

Detailed query


The latest deposits (341) >> more<<  page 3  >>
Secondary data (resource) sldr000720
Transcriptions of CID (Roxane BERTRAND)
2014-05-06 - version 1 - source data
Laboratoire parole et langage - UMR 7309 (LPL, Aix-en-Provence FR)

8 French dialogues involving 2 participants, with the following data:
• a wav file for each speaker
• the inter-pausals units (IPU) annotation aligned with audio signal
• enriched orthographic transcription (TOE) aligned with audio signal
• phones aligned with audio signal
• syllables aligned with audio signal
• tokens aligned with audio signal
• [...]


(phonology, phonetics, speech_prosody, general_linguistics, computational_linguistics)
French

(see members)
picto

    • ANR BLAN08-2_349062

    hdl:11041/sldr000720
    isRequiredBy primary data (corpus) sldr000027 Videos of CID
    project http://lpl-aix.fr/projet/114
    project http://lpl-aix.fr/projet/170
    project http://lpl-aix.fr/projet/71
    (isReferencedBy secondary data (resource) sldr000803 ORCHID.fr)
    (requires secondary data (resource) sldr000873 Transcription Orthographique Enrichie)
    (isPartOf collection ortolang-000901 VariAMU)
    (isRequiredBy secondary data (resource) ortolang-000722 Annotations of CID)
    (isPartOf collection ortolang-000911 CoFee - CoFee)
    (requires secondary data (resource) ortolang-000918 CID-DISP)
    description http://portal.acm.org/citation.cfm?id=1868749...
    isReferencedBy http://www.lrec-conf.org/proceedings/lrec2010...
    isReferencedBy http://sldr.org/publi/104/en
    Primary data (corpus) sldr000786
    MARC-Fr (Brigitte Bigi, Pauline Péri)
    2014-05-05 - version 1 - source data
    Laboratoire parole et langage - UMR 7309 (LPL, Aix-en-Provence FR)

    Corpus français manuellement phonétisé et aligné d’une durée de 7 minutes. Composé de 3 sous-corpus : CID, AixOx et Grenelle.


    (phonology, phonetics, speech_prosody, general_linguistics, computational_linguistics, text_and_corpus_linguistics)
    French (français)

    >> Collection Multimodalité et débats à l'Assemblée nationale - Multimodality and debates in the National Assembly sldr000729
    (see members)
    hdl:11041/sldr000786
    This material is Open Data
    isPartOf primary data (corpus) sldr000027 Videos of CID
    isPartOf collection sldr000729 Multimodalité et débats à l'Assemblée nationale - Multimodality and debates in the National Assembly
    isPartOf primary data (corpus) sldr000784 AixOx
    (isPartOf collection sldr000725 OMProDat)
    Primary data (corpus) sldr000876
    Oursel/Corpus Cours alpha (Elodie OURSEL)
    2014-05-04 - version 1 - source data
    SYstèmes Linguistiques, Enonciation et Discours - EA2290 (SYLED, Paris FR)

    Private course of reading given by a non professional to a algerian woman, mother of children, and auto-confrontation of the teacher to her teaching practice.


    (sociolinguistics, language_acquisition)
    French (français)

    (see members)

    • IRCOM
    • ORTOLANG

    hdl:11041/sldr000876
    (isPartOf collection ortolang-000898 Interactions FLE (Oursel))
    Primary data (corpus) sldr000877
    Oursel/Accompagnement vers l’emploi (Elodie OURSEL)
    2014-05-04 - version 1 - source data
    SYstèmes Linguistiques, Enonciation et Discours - EA2290 (SYLED, Paris FR)
    Analyse et traitement informatique de la langue française - UMR 7118 (ATILF, Nancy FR)

    Meeting between a non native French speaker with a non professional social worker who helps find work.


    (sociolinguistics, language_acquisition)
    French (français)

    (see members)

    • IRCOM
    • ORTOLANG

    hdl:11041/sldr000877
    (isPartOf collection ortolang-000898 Interactions FLE (Oursel))
    Primary data (corpus) sldr000878
    Oursel/Assistante sociale (Elodie OURSEL)
    2014-05-04 - version 1 - source data
    SYstèmes Linguistiques, Enonciation et Discours - EA2290 (SYLED, Paris FR)
    Analyse et traitement informatique de la langue française - UMR 7118 (ATILF, Nancy FR)

    Meeting between people from a social center and a social worker. The people speak French as a second language. The social worker is French.


    (sociolinguistics, language_acquisition)
    French (français)

    (see members)

    • IRCOM
    • ORTOLANG

    hdl:11041/sldr000878
    (isPartOf collection ortolang-000898 Interactions FLE (Oursel))
    Primary data (corpus) sldr000879
    Oursel/Ateliers socio-linguistiques (Elodie OURSEL)
    2014-05-04 - version 1 - source data
    SYstèmes Linguistiques, Enonciation et Discours - EA2290 (SYLED, Paris FR)
    Analyse et traitement informatique de la langue française - UMR 7118 (ATILF, Nancy FR)

    Practical courses in French as a foreign language. A dozen participants, non native French speakers and a non professional teacher.


    (sociolinguistics, language_acquisition)
    French (français)

    (see members)

    • IRCOM
    • ORTOLANG

    hdl:11041/sldr000879
    (isPartOf collection ortolang-000898 Interactions FLE (Oursel))
    Primary data (corpus) sldr000880
    Oursel/Ecole doctorale (Elodie OURSEL)
    2014-05-04 - version 1 - source data
    SYstèmes Linguistiques, Enonciation et Discours - EA2290 (SYLED, Paris FR)
    Analyse et traitement informatique de la langue française - UMR 7118 (ATILF, Nancy FR)

    Meeting between an italian PhD student and a romanian doctoral school secretary. Administrative problem.


    (sociolinguistics, language_acquisition)
    French (français)

    (see members)

    • IRCOM
    • ORTOLANG

    hdl:11041/sldr000880
    (isPartOf collection ortolang-000898 Interactions FLE (Oursel))
    Primary data (corpus) sldr000881
    Oursel/Ecrivain public (Elodie OURSEL)
    2014-05-04 - version 1 - source data
    SYstèmes Linguistiques, Enonciation et Discours - EA2290 (SYLED, Paris FR)
    Analyse et traitement informatique de la langue française - UMR 7118 (ATILF, Nancy FR)

    Meetings between users and a public writer. Non native French speaking users, native speaking French public writer. Miscelaneous administrative problems.


    (sociolinguistics, language_acquisition)
    French (français)

    (see members)

    • IRCOM
    • ORTOLANG

    hdl:11041/sldr000881
    (isPartOf collection ortolang-000898 Interactions FLE (Oursel))
    Primary data (corpus) sldr000882
    Oursel/Entretiens compréhension orale (Elodie OURSEL)
    2014-05-04 - version 1 - source data
    SYstèmes Linguistiques, Enonciation et Discours - EA2290 (SYLED, Paris FR)
    Analyse et traitement informatique de la langue française - UMR 7118 (ATILF, Nancy FR)

    Interviews between the researcher and foreign students about their learning of French, their sojourn in France and the evolution of their ease in understanding French speakers.
    Some recordings also contain a test of recognition of implicits.


    (sociolinguistics, language_acquisition)
    French (français)

    (see members)

    • IRCOM
    • ORTOLANG

    hdl:11041/sldr000882
    (isPartOf collection ortolang-000898 Interactions FLE (Oursel))
    Primary data (corpus) sldr000883
    Oursel/Entretiens conversation (Elodie OURSEL)
    2014-05-04 - version 1 - source data
    SYstèmes Linguistiques, Enonciation et Discours - EA2290 (SYLED, Paris FR)
    Analyse et traitement informatique de la langue française - UMR 7118 (ATILF, Nancy FR)

    Conversations between the researcher and foreign students. Various themes.


    (sociolinguistics, language_acquisition)
    French (français)

    (see members)

    • IRCOM
    • ORTOLANG

    hdl:11041/sldr000883
    (isPartOf collection ortolang-000898 Interactions FLE (Oursel))
    Primary data (corpus) sldr000884
    Oursel/Institut de traducteurs et d’interprètes (Elodie OURSEL)
    2014-05-04 - version 1 - source data
    SYstèmes Linguistiques, Enonciation et Discours - EA2290 (SYLED, Paris FR)
    Analyse et traitement informatique de la langue française - UMR 7118 (ATILF, Nancy FR)

    Meetings between students or futur students in translation or interpretation and the director of the programme. Director native English speaker, students native French speakers.


    (sociolinguistics, language_acquisition)
    French (français)

    (see members)

    • IRCOM
    • ORTOLANG

    hdl:11041/sldr000884
    (isPartOf collection ortolang-000898 Interactions FLE (Oursel))
    Primary data (corpus) sldr000885
    Oursel/Office de l’immigration - Accueil (Elodie OURSEL)
    2014-05-04 - version 1 - source data
    SYstèmes Linguistiques, Enonciation et Discours - EA2290 (SYLED, Paris FR)
    Analyse et traitement informatique de la langue française - UMR 7118 (ATILF, Nancy FR)

    Meetings between users and administrative agents at the office of immigration. Administrative questions about the procedure to obtain the long-stay (1 year) visa.


    (sociolinguistics, language_acquisition)
    French (français)

    (see members)

    • IRCOM
    • ORTOLANG

    hdl:11041/sldr000885
    (isPartOf collection ortolang-000898 Interactions FLE (Oursel))
    Primary data (corpus) sldr000886
    Oursel/Office de l’immigration - Audit (Elodie OURSEL)
    2014-05-04 - version 1 - source data
    SYstèmes Linguistiques, Enonciation et Discours - EA2290 (SYLED, Paris FR)
    Analyse et traitement informatique de la langue française - UMR 7118 (ATILF, Nancy FR)

    Evaluation of the needs for training of the immigrants who obtain a long-stay visa.


    (sociolinguistics, language_acquisition)
    French (français)

    (see members)

    • IRCOM
    • ORTOLANG

    hdl:11041/sldr000886
    (isPartOf collection ortolang-000898 Interactions FLE (Oursel))
    Primary data (corpus) sldr000887
    Oursel/Préfecture (Elodie OURSEL)
    2014-05-04 - version 1 - source data
    SYstèmes Linguistiques, Enonciation et Discours - EA2290 (SYLED, Paris FR)
    Analyse et traitement informatique de la langue française - UMR 7118 (ATILF, Nancy FR)

    Administrative conversations between agents of the prefecture (bureau of visas) and foreign users.


    (sociolinguistics, language_acquisition)
    French (français)

    (see members)

    • IRCOM
    • ORTOLANG

    hdl:11041/sldr000887
    (isPartOf collection ortolang-000898 Interactions FLE (Oursel))
    Primary data (corpus) sldr000888
    Oursel/Secrétariat de FLE (Elodie OURSEL)
    2014-05-04 - version 1 - source data
    SYstèmes Linguistiques, Enonciation et Discours - EA2290 (SYLED, Paris FR)
    Analyse et traitement informatique de la langue française - UMR 7118 (ATILF, Nancy FR)

    Rencontre entre une étudiante de français langue étrangère et un secrétaire de département pour récupérer son diplôme.


    (sociolinguistics, language_acquisition)
    French (français)

    (see members)

    • IRCOM
    • ORTOLANG

    hdl:11041/sldr000888
    (isPartOf collection ortolang-000898 Interactions FLE (Oursel))
    Primary data (corpus) sldr000889
    Oursel/Service des relations internationales (Elodie OURSEL)
    2014-05-04 - version 1 - source data
    SYstèmes Linguistiques, Enonciation et Discours - EA2290 (SYLED, Paris FR)
    Analyse et traitement informatique de la langue française - UMR 7118 (ATILF, Nancy FR)

    Meetings between foreign students at the end of their sojourn in France and the secretary of the service of international relations in a university. Administrative questions.


    (sociolinguistics, language_acquisition)
    French (français)

    (see members)

    • IRCOM
    • ORTOLANG

    hdl:11041/sldr000889
    (isPartOf collection ortolang-000898 Interactions FLE (Oursel))
    Primary data (corpus) sldr000023
    RARHYF (Antoine Giovanni, Centre Hospitalier Universitaire de la Timone)
    2014-04-08 - version 1 - source data
    Laboratoire parole et langage - UMR 7309 (LPL, Aix-en-Provence FR)

    Repeated Acoustic Records in Healthy Young Females


    (phonetics, applied_linguistics, speech_prosody)
    French (français)

    (see members)
    hdl:11041/sldr000023
    Secondary data (resource) sldr000874
    Articulatory Phonology Lexicon (Alain Marchal, Laboratoire parole et langage - UMR 7309)
    2014-03-13 - version 1 - source data
    Laboratoire parole et langage - UMR 7309 (LPL, Aix-en-Provence FR)

    Articulatory Phonology Lexicon (in French)


    (phonology, phonetics)

    (see members)
    hdl:11041/sldr000874
    This material is Open Data
    Secondary data (resource) sldr000873
    Transcription Orthographique Enrichie (Laboratoire parole et langage - UMR 7309 (LPL, Aix-en-Provence FR))
    2014-03-12 - version 1 - source data
    Laboratoire parole et langage - UMR 7309 (LPL, Aix-en-Provence FR)

    Transcription Orthographique Enrichie


    (general_linguistics, phonetics, phonology, speech_prosody, computational_linguistics, applied_linguistics)

    (see members)
    hdl:11041/sldr000873
    project http://lpl-aix.fr/projet/170
    isRequiredBy secondary data (resource) sldr000720 Transcriptions of CID
    project http://lpl-aix.fr/projet/114
    Primary data (corpus) blri-000872
    Raw data for the CATSEM project (MEG) (Jean-michel Badier, INS, Catherine Liegois Chauvel, INS, Johannes Ziegler, LPC)
    2014-03-07 - version 1 - source data
    Laboratoire de Psychologie Cognitive - UMR7290 (LPC, Marseille FR)
    Institut de Neurosciences des Systèmes (INS, Marseille FR)

    Study of the intellectual correlates of oral and written language in magnetoencephalography.

    The objective is to study the intellectual mechanisms which underlie the perception(collection) of word and the reading by means of the technique of magnetoencephalography (MEG). The purpose is to specify the role and the properties of activation of the various intellectual structures implied (involved) [...]


    (cognitive_science, neurolinguistics)
    French (français)

    >> Collection BLRI blri-000835
    (see members)
    picto

    • Brain and Language Research Institute

    hdl:11041/blri-000872
    isPartOf collection blri-000835 BLRI
    Tool sldr000778
    ProZed. A prosody editor for linguists (Daniel HIRST)
    2014-02-24 - version 1 - source data
    Laboratoire parole et langage - UMR 7309 (LPL, Aix-en-Provence FR)

    ProZed is a tool designed to allow linguists to manipulate the prosody of an utterance via a symbolic representation in order to evaluate linguistic models.
    Prosody is manipulated via a Praat TextGrid which allows the user to modify the rhythm and melody.
    Rhythm is manipulated by factoring segmental duration into three components: (i) intrinsic duration determined by phonemic identity (ii) [...]


    (see members)
    hdl:11041/sldr000778
    Secondary data (resource) sldr000871
    SEC POS annotations (Norwegian Computing Centre for the Humanities (NCCH, Bergen NO), Lancaster University (Lancaster UK))
    2014-02-03 - version 1 - source data
    Norwegian Computing Centre for the Humanities (NCCH, Bergen NO)
    Lancaster University (Lancaster UK)

    Aix-MARSEC annotations


    (speech_prosody, phonology, phonetics, computational_linguistics)

    (see members)
      hdl:11041/sldr000871
      isRequiredBy primary data (corpus) sldr000033 Aix-MARSEC database
      [ARK] Tool sldr000526
      Anonymise sound files (Daniel HIRST)
      2014-01-24 - version 5 - long-term preservation
      Laboratoire parole et langage - UMR 7309 (LPL, Aix-en-Provence FR)

      PRAAT script.
      Purpose: replace portions of a long sound which are labelled with a key word on the accompanying TextGrid with a hum sound with the same prosodic characteristics as the original sound.
      Original long sound can be mono or stereo, anonymised sound will be same.
      TextGrid may be constructed from simple table using Table2textgrid.


      (applied_linguistics, cognitive_science, language_documentation, speech_prosody, computational_linguistics)

      >> Collection LPL tools lpl-000763
      (see members)
      hdl:11041/sldr000526
      isPartOf collection lpl-000763 LPL tools
      requires http://www.fon.hum.uva.nl/praat/
      requires tool sldr000811 Table2textgrid
      Primary data (corpus) sldr000870
      Corpus audio de teko (émérillon) (Françoise ROSE)
      2014-01-21 - version 1 - source data
      Dynamique du langage - UMR 5596 (DDL, Lyon FR)

      Collection of recordings of the Teko (Emerillon) language, a Tupi-Guarani language spoken in French Guiana. Recordings are linked to a transcription, French translation, morphological parsing with parts of speech information (data exported from Toolbox).


      (language_documentation, text_and_corpus_linguistics)
      Emerillon (teko)

      (see members)
      picto picto2
      hdl:11041/sldr000870
      This material is Open Data
      Primary data (corpus) sldr000869
      Français parlé dans le nord du Gabon (Magali ITALIA)
      2014-01-14 - version 1 - source data
      Laboratoire parole et langage - UMR 7309 (LPL, Aix-en-Provence FR)

      Français parlé par des Gabonais du nord du pays, des locuteurs âgés peu ou pas scolarisés et de jeunes adultes moyennement scolarisés.


      (syntax, morphology, discourse_analysis)
      French (français)

      (see members)
      picto
      hdl:11041/sldr000869
      Primary data (corpus) sldr000856
      Espagnol L1 dans un contexte francophone - Spanish L1 in a French context (Adrià OLTRA)
      2014-01-10 - version 1 - source data
      Département de sciences du langage, Université d'Aix-Marseille (Aix-en-Provence FR)

      Observing Spanish spoken as L1 by a French born woman from Spanish family. The interest of this study is to evaluate the development of the mother tongue without sociolinguistic context. The corpus collects a random piece of spontaneous speech and two random pieces of a reading from a press text.


      (computational_linguistics, language_acquisition, sociolinguistics, general_linguistics)
      Spanish (Español)

      >> Collection Travaux d'étudiants Master LEX sldr000804
      (see members)

      hdl:11041/sldr000856
      isPartOf collection sldr000804 Travaux d'étudiants Master LEX
      Primary data (corpus) sldr000863
      Francique et français se mélangent - Francique et français se mélangent (Amélie SCHNEIDER)
      2014-01-09 - version 1 - source data
      Département de sciences du langage, Université d'Aix-Marseille (Aix-en-Provence FR)

      Conversation téléphonique d’une locutrice mosellane (50a, née à Bining, langue maternelle : dialecte, autres langues : français, allemand, anglais) avec sa sœur mosellane (49 a, née à Bining, langue maternelle : dialecte, autres langues : français, allemand) en parole spontanée, enregistré au format MP3 à l’aide de mon téléphone portable, puis converti au format [...]


      (computational_linguistics, sociolinguistics)
      Old High German -> Rhine Franconian (Rheinfränkisch)

      >> Collection Travaux d'étudiants Master LEX sldr000804
      (see members)
      picto

      hdl:11041/sldr000863
      isPartOf collection sldr000804 Travaux d'étudiants Master LEX
      Primary data (corpus) sldr000868
      JURISDICT (Adam Mickiewicz University Foundation (Poznan PL))
      2014-01-09 - version 1 - source data
      Adam Mickiewicz University Foundation (Poznan PL)

      The JURISDICT speech database is a large continuous speech database originally designed for dictated speech recognition.
      The database includes above 1500 annotated sessions of speakers from 16 regions of Poland, plus another 500 experimental recordings.
      The JURISDICT database is intended to provide material for both training and testing of speech dictation of common and legal texts, [...]


      (applied_linguistics, phonetics, phonology)
      Polish (język polski)

      (see members)Not (yet) available from this site.
      hdl:11041/sldr000868
      Primary data (corpus) sldr000867
      Code d’alternance dans un contact de langue chez un locuteur berbère (Smail TOUMERT)
      2014-01-08 - version 1 - source data
      Département de sciences du langage, Université d'Aix-Marseille (Aix-en-Provence FR)

      Cet enregistrement a été réalisé dans le cadre de parole spontanée durant une conversation téléphonique entre deux étudiants algériens- à savoir moi même entant que locuteur 1 et locuteur 2 dont on entend pas sa voix- qui vivent en France. Les langues parlées sont le français et une variété du berbère qui est le kabyle.


      (computational_linguistics, sociolinguistics)
      Kabyle (Taqbaylit)

      >> Collection Travaux d'étudiants Master LEX sldr000804
      (see members)
      picto

      hdl:11041/sldr000867
      isPartOf collection sldr000804 Travaux d'étudiants Master LEX
      Primary data (corpus) sldr000866
      Corpus-e/ɛ - Corpus-e/ɛ (Ouissam BAIDADA)
      2014-01-06 - version 1 - source data
      Département de sciences du langage, Université d'Aix-Marseille (Aix-en-Provence FR)

      Ce corpus se compose d’un enregistrement de quatre listes de mots contenant /e/ et /ɛ/ dans des positions différentes, produites par des locuteurs méridionaux. Le but de ce corpus est de montrer la difficulté de discrimination de contraste /e/-/ɛ/ en situation de production par des sujets méridionaux.


      (computational_linguistics, phonetics, text_and_corpus_linguistics, phonology)
      French (français)

      >> Collection Travaux d'étudiants Master LEX sldr000804
      (see members)
      picto

      hdl:11041/sldr000866
      isPartOf collection sldr000804 Travaux d'étudiants Master LEX

      This site has been declared to Commission Nationale de l’Informatique et des Libertés (CNIL) under agreement Nr.1222972 on 26 March 2008. As per French Law, any person cited by name is granted access to, modification, correction and suppression of data relative to him/her (art. 34 of the « Informatique et Libertés » act of 6 January 1978). To exert your right, send a message to webmaster(at)sldr.org.

      This site is optimized for FireFox or any browser with the 'tabs' option set.