Character N-gram Tokenization, Cross-Language Information Retrieval, Information Retrieval, Parallel Corpora, Text Processing, Text Retrieval, Computer Science (0984)
Cluster Sampling; Finite-Mixture and Dirichlet-Multinomial Distributions; Generalized Estimating Equations; Marginal and Conditional Models for Overdispersion; Overdispersion; Random Effects; Statistics (0463)
Needs Analysis; English for Science and Technology; English for Specific Purposes; Language, Linguistics (0290); Education, Curriculum and Instruction (0727); Education, Higher (0745)