1、Types of corpora General vs. specialized corpora Written vs. spoken corpora Synchronic vs. diachronic corpora Monolingual vs. multilingual corpora Comparable vs. parallel corpora Native vs. learner corpora Sample vs. monitor corpora Raw vs. annotated corpora General vs. specialized corpora General c
2、orpora (通用语料库通用语料库) or reference corpora(参考语料库参考语料库): a wide coverage of different text categories or registers; represents language for general purposes. usu.: very large , millions of words. E.g. British National Corpus (BNC), Bank of English (BOE). specialized corpora (专用语料库专用语料库): texts from a p
3、articular variety of a language, e.g. from a particular dialect or from a particular subject area.Written vs. spoken corpora Written corpora(笔语语料库)(笔语语料库):contain only written materials. (more) Spoken corpora(口语语料库)(口语语料库):contain transcribed texts of spoken language. (less)Synchronic vs. diachronic
4、 corpora Synchronic corpora(共时语料库)(共时语料库): materials from a specific period of time. Diachronic corpora(历时语料库):(历时语料库):materials over a longer period of time.Monolingual vs. multilingual corpora Monolingual corpora(单语语料库):(单语语料库):texts in one language. Multilingual corpora(多语语料库):(多语语料库):texts in se
5、veral different languages.Comparable vs. parallel corpora Comparable corpora(可比语料库):可比语料库):texts from two or more languages which are similar in genre, topic, register etc. without, however, containing the same content. Parallel corpora(平行语料库)(平行语料库)(translation corpora)(翻译语料库):(翻译语料库):a corpus of o
6、riginal texts in one language and their translations into another (or several other languages)。探索“同一内容是如何用两种语言表达的” 。Native vs. learner corpora Native speakers corpora(本族语语料库)(本族语语料库):texts from native speakers. Learner corpora(学习者语料库)(学习者语料库):texts from language learners.Sample vs. monitor corpora S
7、ample corpora (样本语料库样本语料库):as opposed to a monitor corpus, a sample corpus is of finite size and consists of text segments selected to provide a static picture of language Monitor corpora (监控语料库监控语料库):monitor language change.It is regularly updated and open-ended.Raw vs. annotated corpora Raw corpor
以上就是本篇文章【语料库语言学:语料库的种类types of corpora】的全部内容了,欢迎阅览 ! 文章地址:http://sjzytwl.xhstdz.com/quote/64382.html 行业 资讯 企业新闻 行情 企业黄页 同类资讯 网站地图 返回首页 物流园资讯移动站 http://mip.xhstdz.com/ , 查看更多