Sunday, August 3, 2008

Some Trends of Asian Language Technology

  • segmentation and tokenization: segment into unique word tokens   
  • Lemmatization: dictionary base form for an inflected verb or adjective 
  • Noun Decompounding: separate compound nouns
  • POS tagging
  • Sentence boundary detection
  • Base NP analysis: identify sets of words including a noun which describes a single nominal expression 

No comments:

About Me

I am the normal young boy with the normal daily life but looking for fantastic nights which brighten my life motivation.