LOB语料库创建时间:1970年代初创建单位:英国Lancaster大学和挪威Oslo大学以及Bergen大学规模层级:100万词次基本情况:研究当代英国英语,与美国英语对比,使用了TAGIT系统,以统计方式建立换算几率矩阵,提高标注正确率。
TheLancaster-OsloBergenCorpus(LOB)wascompiledbyresearchersinLancaster,OsloandBergen.ItconsistsofonemillionwordsofBritishEnglishtextsfrom1961.Thetextsforthecorpusweresampledfrom15differenttextcategories.Eachtextisjustover2.000wordslong(longertextshavebeencutatthefirstsentenceboundaryafter2.000words)andthenumberoftextsineachcategoryvaries(seetablebelow).FurtherinformationaboutthetextscanbefoundintheLOBmanual(externallink).ThiscorpusistheBritishcounterpartoftheBrownCorpusofAmericanEnglish.whichcontainstextsprintedinthesameyearsothatcomparisonbetweenbothvarietiescouldbemade
1