This is a space for inquiries about various languages, translations, or anything else related to language.
Here's an information retrieval/computational linguistics paper by Hackett and Oard: Comparison of Word-Based and Syllable-Based Retrieval for Tibetan
Abstract Tibetan retrieval based on automatically segmented words is compared with the use of overlapping syllable n-grams using a known-item retrieval evaluation. The optimal span of fixed-length n-grams is found to be 2 syllables, and indexing words is found to be as effective as indexing syllable bigrams.
- Former staff member
- Posts: 4684
- Joined: Mon Jan 18, 2010 5:29 pm
- Location: Baltimore, MD
Return to Language
Who is online
Users browsing this forum: Bakmoon and 7 guests