Computational Linguistics & Phonetics Computational Linguistics & Phonetics Fachrichtung 4.7 Universität des Saarlandes
Language Technology 1, Sample Question?

List of Sample Question



Text Classification
  1. Name 5 application areas of text classification.
  2. Describe how a rule based classification approach works. What are the advantages and disadvantages of this approach?
  3. Describe possible linguistic preprocessing steps of statistical classification approaches.
  4. Describe the Naive Bayes classification approach. Why is it called "naive"?
  5. Describe how documents can be represented as vectors.
  6. Name and describe three different weighting schemata for words in document vectors.
  7. Name and shortly describe (without formulas) three methods for feature selection.
  8. Describe the Roccio classification approach. What are its advantages and disadvatages?
  9. Describe the k-nearest neighbors classification approach. What are its advantages and disadvatages?
  10. Describe the idea of the support vector machine classification approach? What are its advantages and disadvantages?
  11. Describe the possible results of a binary classification. How can these be used to define evaluation measures?
  12. Why are precision and recall misleading when examined alone? What are the alternatives?
  13. What is the best classification approach? Why?
  14. What is a character-level n-gram and what are its advantages over term n-grams in text classification?



Shallow Processing & NEE
    Download the Summary File