:: MAIN CONFERENCE :: ChairsArea chairs Invited speakers Instructions for Authors List of accepted papers General Conference Chair: Donia Scott (University of Brighton, UK) Local Organization Chair: Toni Badia (Universitat Pompeu Fabra, Spain) Program co-chairs:
Walter Daelemans
Marilyn Walker
Elisabeth Andre, University of Augsburg, Germany The flexibility of human speech recognition and the seeds of language change ANNE CUTLER, Max Planck Institute for Psycholinguistics Phonetic categories of the mother tongue are learned early, and are notoriously well-anchored, in the sense that they can interfere with speech perception in second languages. However, phonetic categories of the mother tongue are not immutable. Category boundaries change over time within language communities and within individuals. A series of experiments (beginning with Norris, McQueen & Cutler, Cog. Psych., 2003) has explored the factors which determine how adult listeners retune phonetic contrasts. Dutch subjects made lexical decisions on words and nonwords, where a final phoneme in some words had been replaced by an ambiguous sound. This experience led them to shift their category boundary for a subsequent phonetic categorisation task. The boundary shift only occurred when lexical information was available to train the categorisation; exposure to the ambiguous sound in nonwords did not induce retuning. Explicit attention to the lexical information is, however, not required. Retuning can be initially speaker-specific and thus enables adjustment for individual speakers without affecting perception of other speakers. Finally, retuning generalises across words. These results suggest that feedback links in speech perception are used for learning but not for online processing, and they further argue against speech perception models with no abstract prelexical or lexical representations (such as radical episodic models).
If I Have a Hammer:
Jack Mostow, Research Professor
Project LISTEN’s Reading Tutor uses speech recognition to listen to children read aloud, and helps them learn to read, as evidenced by rigorous evaluations of pre- to posttest gains compared to various controls. In the 2003-2004 school year, children ages 5-14 used the Reading Tutor daily at school on over 200 computers, logging over 50,000 sessions, 1.5 million tutorial responses, and 10 million words.
· A production model predicts student behavior, such as likely oral reading mistakes. · A language model predicts likely word sequences for a given task, such as oral reading. · A student model estimates a student’s skills, such as mastery of grapheme-to-phoneme mappings. · A pedagogical model guides tutorial decisions, such as choosing words a student is ready to try. Acknowledgements
This work was supported in part by the National Science Foundation under ITR/IERI Grant No. REC-0326153. Any opinions, findings, conclusions, or recommendations expressed in this publication are those of the author and do not necessarily reflect the views of the National Science Foundation or the official policies, either expressed or implied, of the sponsors or of the United States Government.
For camera-ready versions of papers, use the instructions in the following page: A DISTRIBUTIONAL MODEL OF SEMANTIC CONTEXT EFFECTS IN LEXICAL PROCESSING Scott Macdonald Chris Brew
A GEOMETRIC VIEW ON BILINGUAL LEXICON EXTRACTION FROM COMPARABLE
CORPORA
A JOINT SOURCE-CHANNEL MODEL FOR MACHINE TRANSLITERATION
A KERNEL PCA METHOD FOR SUPERIOR WORD SENSE DISAMBIGUATION
A MENTION-SYNCHRONOUS COREFERENCE RESOLUTION ALGORITHM BASED ON THE
BELL TREE
A SENTIMENTAL EDUCATION: SENTIMENT ANALYSIS USING SUBJECTIVITY
SUMMARIZATION
A STUDY ON CONVOLUTION KERNELS FOR SHALLOW SEMANTIC PARSING
A TAG-BASED NOISY CHANNELMODEL OF SPEECH REPAIRS
A UNIFIED FRAMEWORK FOR AUTOMATIC EVALUATION USING N-GRAM CO-OCCURRENCE
STATISTICS
ACQUIRING THE MEANING OF DISCOURSE MARKERS
ADAPTIVE CHINESE WORD SEGMENTATION
ALIGNING WORDS USING MATRIX FACTORISATION
ALTERNATIVE APPROACHES FOR GENERATING BODIES OF GRAMMAR RULES
AN ALTERNATIVE METHOD OF TRAINING PROBABILISTIC LR PARSERS
AN EMPIRICAL STUDY OF INFORMATION SYNTHESIS TASKS
ANALYSIS OF MIXED NATURAL AND SYMBOLIC LANGUAGE INPUT IN MATHEMATICAL
DIALOGS
ANNEALING TECHNIQUES FOR UNSUPERVISED STATISTICAL LANGUAGE LEARNING
APPLYING MACHINE LEARNING TO CHINESE TEMPORAL RELATION RESOLUTION
ATTENTION SHIFTING FOR PARSING SPEECH
AUTOMATIC EVALUATION OF MACHINE TRANSLATION QUALITY USING LONGEST
COMMON SUBSEQUENCE AND SKIP-BIGRAM STATISTICS
BALANCING CLARITY AND EFFICIENCY IN TYPED FEATURE LOGIC THROUGH
DELAYING
BUILDING VERB PREDICATES: A COMPUTATIONAL VIEW
CHINESE VERB SENSE DISCRIMINATION USING AN EM CLUSTERING MODEL WITH RICH LINGUISTIC FEATURES
CLASSIFYING SEMANTIC RELATIONS IN BIOSCIENCE TEXTS
COLLECTIVE INFORMATION EXTRACTION WITH RELATIONAL MARKOV NETWORKS
COLLOCATION TRANSLATION ACQUISITION USING MONOLINGUAL CORPORA
COMBINING ACOUSTIC AND PRAGMATIC FEATURES TO PREDICT RECOGNITION
PERFORMANCE IN SPOKEN DIALOGUE SYSTEMS
COMPUTING LOCALLY COHERENT DISCOURSES
CONSTRUCTIVIST DEVELOPMENT OF GROUNDED CONSTRUCTION GRAMMAR
CONVOLUTION KERNELS WITH FEATURE SELECTION FOR NATURAL LANGUAGE
PROCESSING TASKS
CORPUS-BASED INDUCTION OF SYNTACTIC STRUCTURE: MODELS OF DEPENDENCY AND
CONSTITUENCY
CREATING MULTILINGUAL TRANSLATION LEXICONS WITH REGIONAL VARIATIONS
USING WEB CORPORA
DATA-DRIVEN STRATEGIES FOR AN AUTOMATED DIALOGUE SYSTEM
DEPENDENCY TREE KERNELS FOR RELATION EXTRACTION
DEVELOPING A FLEXIBLE SPOKEN DIALOG SYSTEM USING SIMULATION
DIMENSIONS OF PARSING
DISCOVERING RELATIONS AMONG NAMED ENTITIES FROM LARGE CORPORA
DISCRIMINATIVE LANGUAGE MODELING WITH CONDITIONAL RANDOM FIELDS AND THE
PERCEPTRON ALGORITHM
DISCRIMINATIVE TRAINING OF A NEURAL NETWORK STATISTICAL PARSER
ENRICHING THE OUTPUT OF A PARSER USING MEMORY-BASED LEARNING
ERROR MINING FOR WIDE-COVERAGE GRAMMAR ENGINEERING
EVALUATING CENTERING-BASED METRICS OF COHERENCE FOR TEXT STRUCTURING
USING A RELIABLY ANNOTATED CORPUS
EXPERIMENTS IN PARALLEL-TEXT BASED GRAMMAR INDUCTION
EXTENDING BLEU MT EVALUATION METHOD WITH FREQUENCY WEIGHING
EXTRACTING REGULATORY GENE EXPRESSION NETWORKS FROM PUBMED
FINDING IDEOGRAPHIC REPRESENTATIONS OF JAPANESE NAMES WRITTEN IN LATIN
SCRIPT VIA LANGUAGE IDENTIFICATION AND CORPUS VALIDATION
FINDING PREDOMINANT WORD SENSES IN UNTAGGED TEXT
FLSA: EXTENDING LATENT SEMANTIC ANALYSIS WITH FEATURES\\ FOR DIALOGUE
ACT CLASSIFICATION
FSA: AN EFFICIENT AND FLEXIBLE C++ TOOLKIT FOR FINITE STATE AUTOMATA
USING ON-DEMAND COMPUTATION
GENERALIZED MULTITEXT GRAMMARS
GENERATING REFERRING EXPRESSIONS IN OPEN DOMAINS
HEAD-DRIVING PARSING FOR WORD LATTICES
IDENTIFYING AGREEMENT AND DISAGREEMENT IN CONVERSATIONAL SPEECH: USE OF
BAYESIAN NETWORKS TO MODEL PRAGMATIC DEPENDENCIES
IMPROVING IBM WORD-ALIGNMENT MODEL 1
IMPROVING PRONOUN RESOLUTION BY INCORPORATING COREFERENTIAL INFORMATION
OF CANDIDATES
INCREMENTAL PARSING WITH THE PERCEPTRON ALGORITHM
INDUCING FRAME SEMANTIC VERB CLASSES FROM WORDNET AND LDOCE
LARGE-SCALE INDUCTION AND EVALUATION OF LEXICAL RESOURCES FROM THE
PENN-II TREEBANK
LEARNING NOUN PHRASE ANAPHORICITY TO IMPROVE COREFERENCE RESOLUTION:
ISSUES IN REPRESENTATION AND OPTIMIZATION
LEARNING TO RESOLVE BRIDGING REFERENCES
LEARNING WITH UNLABELED DATA FOR TEXT CATEGORIZATION USING A
BOOTSTRAPPING AND A FEATURE PROJECTION TECHNIQUE
LEARNING WORD SENSE WITH FEATURE SELECTION AND ORDER IDENTIFICATION
CAPABILITIES
LINGUISTIC PROFILING FOR AUTHORSHIP RENOGNITION AND VERIFICATION
LONG-DISTANCE DEPENDENCY RESOLUTION IN AUTOMATICALLY ACQUIRED
WIDE-COVERAGE PCFG-BASED LFG APPROXIMATIONS
MINIMAL RECURSION SEMANTICS AS DOMINANCE CONSTRAINTS: TRANSLATION,
EVALUATION, AND ANALYSIS
MINING METALINGUISTIC ACTIVITY IN CORPORA TO CREATE LEXICAL RESOURCES
USING INFORMATION EXTRACTION TECHNIQUES: THE MOP SYSTEM
MULTI-CRITERIA-BASED ACTIVE LEARNING FOR NAMED ENTITY RECOGNITION
MULTI-ENGINE MACHINE TRANSLATION WITH VOTED LANGUAGE MODEL
OPTIMIZATION IN MULTIMODAL INTERPRETATION
OPTIMIZING TYPED FEATURE STRUCTURE GRAMMAR PARSING THROUGH
NON-STATISTICAL INDEXING
PARAGRAPH-, WORD-, AND COHERENCE-BASED APPROACHES TO SENTENCE RANKING:
A COMPARISON OF ALGORITHM AND HUMAN PERFORMANCE
PARSING THE WSJ USING CCG AND LOG-LINEAR MODELS
PREDICATE-ARGUMENT STRUCTURE FROM BROAD-COVERAGE PARSE TREES: IMPROVING
ON THE CONTEXT-FREE APPROXIMATION
PREDICTING STUDENT EMOTIONS IN COMPUTER-HUMAN TUTORING DIALOGUES
PROBABILISTIC PARSING STRATEGIES
QUESTION ANSWERING USING CONSTRAINT SATISFACTION:
QA-BY-DOSSIER-WITH-CONSTRAINTS
RELIEVING THE DATA ACQUISITION BOTTLENECK IN WORD SENSE DISAMBIGUATION
SPLITTING COMPLEX TEMPORAL QUESTIONS FOR QUESTION ANSWERING SYSTEMS
STATISTICAL MACHINE TRANSLATION WITH WORD- AND SENTENCE-ALIGNED
PARALLEL CORPORA
STATISTICAL MODELING FOR UNIT SELECTION IN SPEECH SYNTHESIS
THE SENTIMENTAL FACTOR: IMPROVING REVIEW CLASSIFICATION VIA
HUMAN-PROVIDED INFORMATION
TRAINABLE SENTENCE PLANNING FOR COMPLEX INFORMATION PRESENTATION IN
SPOKEN DIALOG SYSTEMS
UNSUPERVISED LEARNING OF CHINESE VERB SENSES BY USING AN EM CLUSTERING
MODEL WITH RICH LINGUISTIC FEATURES
UNSUPERVISED SENSE DISAMBIGUATION USING BILINGUAL PROBABILISTIC MODELS
USER EXPERTISE MODELLING AND ADAPTIVITY IN A SPEECH-BASED E-MAIL SYSTEM
USING CONDITIONAL RANDOM FIELDS TO PREDICT PITCH ACCENTS IN
CONVERSATIONAL SPEECH
USING LINGUISTIC PRINCIPLES TO RECOVER EMPTY CATEGORIES
WEAKLY SUPERVISED LEARNING FOR CROSS-DOCUMENT PERSON NAME
DISAMBIGUATION SUPPORTED BY INFORMATION EXTRACTION
WRAPPING OF TREES |