Computational Linguistics and Intelligent Text Processing (Part 2)


Computational Linguistics and Intelligent Text Processing (Part 2) .. CICLing 2011


Machine Translation and Multilingualism

Leonardo Lesmo, Alessandro Mazzei, Daniele P. Radicioni:
Ontology Based Interlingua Translation. 1-12

Rashmi Gangadharaiah, Ralf D. Brown, Jaime G. Carbonell:
Phrasal Equivalence Classes for Generalized Corpus-Based Machine Translation. 13-28

Raphaël Rubino, Georges Linares:
A Multi-view Approach for Term Translation Spotting. 29-40

Waleed Ammar, Kareem Darwish, Ali El Kahki, Khaled Hafez:
ICE-TEA: In-Context Expansion and Translation of English Abbreviations. 41-54

Michael Paul, Andrew M. Finch, Eiichiro Sumita:
Word Segmentation for Dialect Translation. 55-67

Mohammad Taher Pilevar, Heshaam Faili, Abdol Hamid Pilevar:
TEP: Tehran English-Persian Parallel Corpus. 68-79

Daniel Andrade, Takuya Matsuzaki, Jun-ichi Tsujii:
Effective Use of Dependency Structure for Bilingual Lexicon Creation. 80-92

Pascual Martínez-Gómez, Germán Sanchis-Trilles, Francisco Casacuberta:
Online Learning via Dynamic Reranking for Computer Assisted Translation. 93-105

Information Extraction and Information Retrieval

Hans Uszkoreit:
Learning Relation Extraction Grammars with Minimal Human Intervention: Strategy, Results, Insights and Plans. 106-126

Haibo Li, Danushka Bollegala, Yutaka Matsuo, Mitsuru Ishizuka:
Using Graph Based Method to Improve Bootstrapping Relation Extraction. 127-138

Asma Ben Abacha, Pierre Zweigenbaum:
A Hybrid Approach for the Extraction of Semantic Relations from MEDLINE Abstracts. 139-150

Jon Patrick, Mojtaba Sabbagh:
An Active Learning Process for Extraction and Standardisation of Medical Measurements by a Trainable FSA. 151-162

Dongwoo Kim, Alice H. Oh:
Topic Chains for Understanding a News Corpus. 163-176

Livio Robaldo, Tommaso Caselli, Irene Russo, Matteo Grella:
From Italian Text to TimeML Document via Dependency Parsing. 177-187

Shoji Fujiwara, Satoshi Sekine:
Self-adjusting Bootstrapping. 188-201

Letian Wang, Fang Li:
Story Link Detection Based on Event Words. 202-211

G. S. K. Santosh, N. Kiran Kumar, Vasudeva Varma:
Ranking Multilingual Documents Using Minimal Language Dependent Resources. 212-220

Yunqing Xia, Taotao Zhao, Jianmin Yao, Peng Jin:
Measuring Chinese-English Cross-Lingual Word Similarity with HowNet and Parallel Corpus. 221-233

Text Categorization and Classification

Hercules Dalianis, Jonas Sjöbergh, Eriks Sneiders:
Comparing Manual Text Patterns and Machine Learning for Classification of E-Mails for Automatic Answering by a Government Agency. 234-243

Nooshin Maghsoodi, Mohammad Mehdi Homayounpour:
Using Thesaurus to Improve Multiclass Text Classification. 244-253

Dat Huynh, Dat Tran, Wanli Ma, Dharmendra Sharma:
Adaptable Term Weighting Framework for Text Classification. 254-265

Iria da Cunha, M. Teresa Cabré, Eric SanJuan, Gerardo Sierra, Juan Manuel Torres Moreno, Jorge Vivaldi:
Automatic Specialized vs. Non-specialized Sentence Differentiation. 266-276

B. Thomas Adler, Luca de Alfaro, Santiago Moisés Mola-Velasco, Paolo Rosso, Andrew G. West:
Wikipedia Vandalism Detection: Combining Natural Language, Metadata, and Reputation Features. 277-288

Su Yan, Dongwon Lee, Alex Hai Wang:
Costco: Robust Content and Structure Constrained Clustering of Networked Documents. 289-300

Summarization and Recognizing Textual Entailment

Horacio Saggion:
Learning Predicate Insertion Rules for Document Abstracting. 301-312

Jun Hatori, Akiko Murakami, Jun-ichi Tsujii:
Multi-topical Discussion Summarization Using Structured Lexical Chains and Cue Words. 313-327

Nik Adilah Hanin Binti Zahri, Fumiyo Fukumoto:
Multi-document Summarization Using Link Analysis Based on Rhetorical Relations between Sentences. 328-338

Yunqing Xia, Yonggang Zhang, Jianmin Yao:
Co-clustering Sentences and Terms for Multi-document Summarization. 339-352

Partha Pakray, Alexander F. Gelbukh, Sivaji Bandyopadhyay:
Answer Validation Using Textual Entailment. 353-364

Authoring Aid, Error Correction, and Style Analysis

Anabela Barreiro:
SPIDER: A System for Paraphrasing in Document Editing and Revision – Applicability in Machine Translation Pre-editing. 365-376

Ching-man Au Yeung, Kevin Duh, Masaaki Nagata:
Providing Cross-Lingual Editing Assistance to Wikipedia Editors. 377-389

Fabrizio Gotti, Philippe Langlais, Guy Lapalme, Simon Charest, Eric Brunelle:
Reducing Overdetections in a French Symbolic Grammar Checker by Classification. 390-401

Theologos Athanaselis, Stelios Bakamidis, Ioannis Dologlou:
Performance Evaluation of a Novel Technique for Word Order Errors Correction Applied to Non Native English Speakers’ Corpus. 402-410

Xiaohua Liu, Bo Han, Ming Zhou:
Correcting Verb Selection Errors for ESL with the Perceptron. 411-423

Philip van Oosten, Véronique Hoste, Dries Tanghe:
A Posteriori Agreement as a Quality Measure for Readability Prediction Systems. 424-435

Keiji Yasuda, Andrew M. Finch, Eiichiro Sumita:
A Method to Measure the Reading Difficulty of Japanese Words. 436-445

Shibamouli Lahiri, Prasenjit Mitra, Xiaofei Lu:
Informality Judgment at Sentence Level and Experiments with Formality Score. 446-457

Speech Recognition and Generation

M. Alejandro Reyes-Barragán, Manuel Montes-y-Gómez, Luis Villaseñor Pineda:
Combining Word and Phonetic-Code Representations for Spoken Document Retrieval. 458-466

Zeeshan Ahmed, Julie Carson-Berndsen:
Automatic Rule Extraction for Modeling Pronunciation Variation. 467-476

Jun Hatori, Hisami Suzuki:
Predicting Word Pronunciation in Japanese. 477-492

Chonlasith Jucksriporn, Ohm Sornil:
A Minimum Cluster-Based Trigram Statistical Model for Thai Syllabification. 493-505

Panagiota Karanasou, Lori Lamel:
Automatic Generation of a Pronunciation Dictionary with Rich Variation Coverage Using SMT Methods. 506-517