CALL(8):Corpus Linguistics 语料库语言学

来源:百度文库 编辑:神马文学网 时间:2024/07/03 13:48:44
Corpus Linguistics 语料库语言学
2007-04-27 11:22:07
部分重要概念
Text Corpus In linguistics, a corpus (plural corpora) or text corpus is a large and structured set of texts (now usually electronically stored and processed). They are used to do statistical analysis, checking occurrences or validating linguistic rules on a specific universe.
Brown Corpus The Brown Corpus of Standard American English (or just Brown Corpus) was compiled by Henry Kucera and W. Nelson Francis at Brown University, Providence, RI as a general corpus (text collection) in the field of corpus linguistics.
Bank of English The Bank of English is the name of the COBUILD corpus, a collection of English texts. These are mainly British, but American and Australian data are also included.
Part-of-Speech Tagging
Part-of-speech tagging (POS tagging or POST), also called grammatical tagging, is the process of marking up the words in a text as corresponding to a particular part of speech, based on both its definition, as well as its context, i.e. relationship with adjacent and related words in a phrase, sentence, or paragraph.
重要参考文献
何安平,2004,《语料库语言学与英语教学》,北京:外语教学与研究出版社。
杨惠中(编),2002,《语料库语言学导论》,上海:上海外与教育出版社。Gavioli, L. (2005). Exploring corpora for ESP learning. Amsterdam: John Benjamins.
华南师范大学外国语言文化学院编委会(编),2005,《语料库语言学的研究与应用》,长春:东北师范大学出版社。
Kennedy, G. (2000). An introduction to corpus linguistics [语料库语言学入门], 北京:外语教学与研究出版社。
Deignan, A. (2005). Metaphor and corpus linguistics. Amsterdam: John Benjamins.
Dash, N. S. (2005). Corpus linguistics and language technology: With reference to Indian language. New Delhi: Mittal Publications.
Connor, U. & Upton, T. A. (2004). (Eds.) Applied corpus linguistics: A multidimensional perspective. New York: Rodopi.
Halliday, M.A.K. et al. (2004). Lexicography and corpus linguistics: An introduction. New York: Continuum.
领域前沿
Mark Davies, Brigham Young University
 http://davies-linguistics.byu.edu/personal/
Susan Hunston, University of Birmingham
 http://www.english.bham.ac.uk/who/hunston.htm
Gary Kennedy, Ohio State University
 http://www.math.ohio-state.edu/~kennedy/
Wolfgang Teubert, University of Birmingham
 http://www.english.bham.ac.uk/who/teubert.htm
Corpus Linguistics 2007, the fourth Corpus Linguistics conference, the University of Birmingham
 http://www.corpus.bham.ac.uk/conference2007/
International Journal of Corpus Linguistics
 http://www.benjamins.com/cgi-bin/t_seriesview.cgi?series=IJCL
The Inter-Varietal Applied Corpus Studies (IVACS)
 http://www.mic.ul.ie/ivacs/about.htm
British National Corpus
 http://www.comp.lancs.ac.uk/computing/research/ucrel/bnc.html