Essential Speech and Language Technology for Dutch [electronic resource] : Results by the STEVIN-programme / edited by Peter Spyns, Jan Odijk.

Contributor(s): Material type: TextTextSeries: Theory and Applications of Natural Language ProcessingPublisher: Berlin, Heidelberg : Springer Berlin Heidelberg : Imprint: Springer, 2013Description: XVII, 413 p. 79 illus., 29 illus. in color. online resourceContent type:
  • text
Media type:
  • computer
Carrier type:
  • online resource
ISBN:
  • 9783642309106
Subject(s): Additional physical formats: Printed edition:: No titleDDC classification:
  • 410.285 23
LOC classification:
  • P98-98.5
Online resources:
Contents:
Foreword. by Linde van den Bosch -- Introduction -- Peter Spyns -- Part I. How it started -- 2.The STEVIN Programme: Result of Five Years Cross-Border HLT for Dutch Policy Preparation. P.Spyns and E.D’Halleweyn -- Part II. HLT Resource-project Related Papers -- 3.The JASMIN Speech Corpus: Recordings of Children, Non-Natives and Elderly People. C. Cucchiarini and H. Van Hamme -- 4.Resources Developed in the Autonomata Projects. H.van den Heuvel, J-P.Martens, G.Bloothooft, M.Schraagen, N.Konings, K.D’hanens, and Q.Yang -- 5.STEVIN can Praat. D.Weenink -- 6.SPRAAK: Speech Processing, Recognition and Automatic Annotation Kit. P.Wambacq, K.Demuynck, and D.Van Compernolle -- 7.COREA: Coreference Resolution for Extracting Answers for Dutch. I.Hendrickx, G.Bouma, W.Daelemans and V.Hoste -- 8.Automatic Tree Matching for Analysing Semantic Similarity in Comparable Text. E.Marsi and E.Krahmer -- 9.Large Scale Syntactic Annotation of Written Dutch: Lassy. G.van Noord, G.Bouma, F.van Eynde, D.de Kok, J.van der Linde, I.Schuurman, E.Tjong Kim Sang, and V.Vandeghinste -- 10.Cornetto: a Combinatorial Lexical Semantic Database for Dutch. P.Vossen, I.Maks, R.Segers, H.van der Vliet, M-F.Moens, K.Hofmann, E.Tjong Kim Sang, and M.de Rijke -- 11.Dutch Parallel Corpus: a Balanced Parallel Corpus for Dutch-English and Dutch-French. H.Paulusen, L.Macken, W.Vandeweghe, and P.Desmet -- 12.Identification and Lexical Representation of Multiword Expressions. J.Odijk -- 13.The Construction of a 500-million-word Reference Corpus of Contemporary Written Dutch. N.Oostdijk, M.Reynaert, V.Hoste, and I.Schuurman -- Part III. HLT Technology Related Papers -- 14.Lexical Modeling for Proper Name Recognition in Autonomata Too -- B.Réveil, J-P.Martens, H.van den Heuvel, G.Bloothooft, and M.Schraagen -- 15.N-Best 2008: a Benchmark Evaluation for Large Vocabulary Speech Recognition in Dutch. D.A. van Leeuwen -- 16.Missing Data Solutions for Robust Speech Recognition. Y.Wang, J.F.Gemmeke, K.Demuynck, and H.Van Hamme -- 17.Parse and Corpus-based Machine Translation. V.Vandeghinste, S.Martens, G.Kotzé, J.Tiedemann, J.Van den Bogaert, K.De Smet, F.Van Eynde, and G.van Noord -- Part IV.HLT Application Related Papers -- 18.Development and Integration of Speech technology into COurseware for Language Learning: the DISCO Project. H. Strik, J. van Doremalen, J. Colpaert, and C. Cucchiarini -- 19.Question Answering of Informative Web Pages: How Summarisation Technology Helps. J.De Belder, D.de Kok, G.van Noord, F.Nauze, L.van der Beek, and M-F.Moens -- 20.Generating, Refining and Using Sentiment Lexicons. M.de Rijke, V.Jijkoun, F.Laan, W.Weerkamp, P.Ackermans, and G.Geleijnse -- Part V. And now -- 21.The Dutch-Flemish HLT Agency: Managing the Lifecycle of STEVIN’s Language Resources. R.van Veenendaal, L.van Eerten, C.Cucchiarini, and P.Spyns -- 22.Conclusions and Outlook to the Future. Jan Odijk.
In: Springer eBooksSummary: The book provides an overview of more than a decade of joint R&D efforts in the Low Countries on HLT for Dutch. It not only presents the state of the art of HLT for Dutch in the areas covered, but, even more importantly, a description of the resources (data and tools) for Dutch that have been created  are now  available for both academia and industry worldwide. The contributions cover many areas of human language technology (for Dutch): corpus collection (including IPR issues) and building (in particular one corpus aiming at a collection of 500M word tokens), lexicology, anaphora resolution, a semantic network, parsing technology, speech recognition, machine translation, text (summaries) generation, web mining, information extraction, and text to speech to name the most important ones. The book also shows how a medium-sized language community (spanning two territories) can create a digital language infrastructure (resources, tools, etc.) as a basis for subsequent R&D. At the same time, it bundles contributions of almost all the HLT research groups in Flanders and the Netherlands, hence offers a view of their recent research activities. Targeted readers are mainly researchers in human language technology, in particular those focusing on Dutch. It concerns researchers active in larger networks such as the CLARIN, META-NET, FLaReNet and participating in conferences such as ACL, EACL, NAACL, COLING, RANLP, CICling, LREC, CLIN and DIR ( both in the Low Countries), InterSpeech, ASRU, ICASSP, ISCA, EUSIPCO, CLEF, TREC, etc. In addition, some chapters are interesting for human language technology  policy makers and even for science policy makers in general.
Tags from this library: No tags from this library for this title. Log in to add tags.
Star ratings
    Average rating: 0.0 (0 votes)
No physical items for this record

Foreword. by Linde van den Bosch -- Introduction -- Peter Spyns -- Part I. How it started -- 2.The STEVIN Programme: Result of Five Years Cross-Border HLT for Dutch Policy Preparation. P.Spyns and E.D’Halleweyn -- Part II. HLT Resource-project Related Papers -- 3.The JASMIN Speech Corpus: Recordings of Children, Non-Natives and Elderly People. C. Cucchiarini and H. Van Hamme -- 4.Resources Developed in the Autonomata Projects. H.van den Heuvel, J-P.Martens, G.Bloothooft, M.Schraagen, N.Konings, K.D’hanens, and Q.Yang -- 5.STEVIN can Praat. D.Weenink -- 6.SPRAAK: Speech Processing, Recognition and Automatic Annotation Kit. P.Wambacq, K.Demuynck, and D.Van Compernolle -- 7.COREA: Coreference Resolution for Extracting Answers for Dutch. I.Hendrickx, G.Bouma, W.Daelemans and V.Hoste -- 8.Automatic Tree Matching for Analysing Semantic Similarity in Comparable Text. E.Marsi and E.Krahmer -- 9.Large Scale Syntactic Annotation of Written Dutch: Lassy. G.van Noord, G.Bouma, F.van Eynde, D.de Kok, J.van der Linde, I.Schuurman, E.Tjong Kim Sang, and V.Vandeghinste -- 10.Cornetto: a Combinatorial Lexical Semantic Database for Dutch. P.Vossen, I.Maks, R.Segers, H.van der Vliet, M-F.Moens, K.Hofmann, E.Tjong Kim Sang, and M.de Rijke -- 11.Dutch Parallel Corpus: a Balanced Parallel Corpus for Dutch-English and Dutch-French. H.Paulusen, L.Macken, W.Vandeweghe, and P.Desmet -- 12.Identification and Lexical Representation of Multiword Expressions. J.Odijk -- 13.The Construction of a 500-million-word Reference Corpus of Contemporary Written Dutch. N.Oostdijk, M.Reynaert, V.Hoste, and I.Schuurman -- Part III. HLT Technology Related Papers -- 14.Lexical Modeling for Proper Name Recognition in Autonomata Too -- B.Réveil, J-P.Martens, H.van den Heuvel, G.Bloothooft, and M.Schraagen -- 15.N-Best 2008: a Benchmark Evaluation for Large Vocabulary Speech Recognition in Dutch. D.A. van Leeuwen -- 16.Missing Data Solutions for Robust Speech Recognition. Y.Wang, J.F.Gemmeke, K.Demuynck, and H.Van Hamme -- 17.Parse and Corpus-based Machine Translation. V.Vandeghinste, S.Martens, G.Kotzé, J.Tiedemann, J.Van den Bogaert, K.De Smet, F.Van Eynde, and G.van Noord -- Part IV.HLT Application Related Papers -- 18.Development and Integration of Speech technology into COurseware for Language Learning: the DISCO Project. H. Strik, J. van Doremalen, J. Colpaert, and C. Cucchiarini -- 19.Question Answering of Informative Web Pages: How Summarisation Technology Helps. J.De Belder, D.de Kok, G.van Noord, F.Nauze, L.van der Beek, and M-F.Moens -- 20.Generating, Refining and Using Sentiment Lexicons. M.de Rijke, V.Jijkoun, F.Laan, W.Weerkamp, P.Ackermans, and G.Geleijnse -- Part V. And now -- 21.The Dutch-Flemish HLT Agency: Managing the Lifecycle of STEVIN’s Language Resources. R.van Veenendaal, L.van Eerten, C.Cucchiarini, and P.Spyns -- 22.Conclusions and Outlook to the Future. Jan Odijk.

Open Access

The book provides an overview of more than a decade of joint R&D efforts in the Low Countries on HLT for Dutch. It not only presents the state of the art of HLT for Dutch in the areas covered, but, even more importantly, a description of the resources (data and tools) for Dutch that have been created  are now  available for both academia and industry worldwide. The contributions cover many areas of human language technology (for Dutch): corpus collection (including IPR issues) and building (in particular one corpus aiming at a collection of 500M word tokens), lexicology, anaphora resolution, a semantic network, parsing technology, speech recognition, machine translation, text (summaries) generation, web mining, information extraction, and text to speech to name the most important ones. The book also shows how a medium-sized language community (spanning two territories) can create a digital language infrastructure (resources, tools, etc.) as a basis for subsequent R&D. At the same time, it bundles contributions of almost all the HLT research groups in Flanders and the Netherlands, hence offers a view of their recent research activities. Targeted readers are mainly researchers in human language technology, in particular those focusing on Dutch. It concerns researchers active in larger networks such as the CLARIN, META-NET, FLaReNet and participating in conferences such as ACL, EACL, NAACL, COLING, RANLP, CICling, LREC, CLIN and DIR ( both in the Low Countries), InterSpeech, ASRU, ICASSP, ISCA, EUSIPCO, CLEF, TREC, etc. In addition, some chapters are interesting for human language technology  policy makers and even for science policy makers in general.

There are no comments on this title.

to post a comment.