Abstract
Based on the modern Chinese semantics, a Chinese sentential semantic mode is built, and then a Chinese tagged corpus, BFS-CTC (Beijing forest studio-Chinese tagged corpus), is built according to the Chinese sentential semantic mode. There are more than ten thousand sentences in the corpus, and the corpus contains six kinds of Chinese syntactic types. Tagging the sentence quickly and conveniently could be implemented by using the self-developed tools. BFS-CTC provides lexical, syntactic and sentential semantic structure tagging information, so that it could be used in comparative analysis of syntactic and semantic, or used for horizontal analysis. In addition, the corpus has good scalability, and it could generate more targeted extension tagged banks.
Original language | English |
---|---|
Pages (from-to) | 311-315 |
Number of pages | 5 |
Journal | Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology |
Volume | 32 |
Issue number | 3 |
Publication status | Published - Mar 2012 |
Keywords
- Chinese information processing
- Corpus
- Semantic labeling
- Sentential semantic analysis
- Sentential semantic structure