Andrew Gargett (Saarland University) will be visiting us 23-27/8. If you would like to organize time to meet with him contact Ellen Breitholtz (firstname.lastname@example.org).
In this talk, I will describe our research on spelling variation in Early Modern English (EmodE). Our original motivation was on retraining a semantic field annotation system (USAS, first designed for modern English) for historical corpora. Now, the research has taken a diversion into the detection of historical spelling variants as they cause significant problems for corpus-based computational linguistics techniques and tools. I will quantify the extent of the problem in a variety of EmodE corpora and show how it affects simple procedures such as key word comparisons. Our solution will be described, a corpus pre-processing tool called VARD (the Variant Detector), which uses techniques adapted from modern spell checkers. VARD offers modern equivalents for historical variants with high levels of precision and recall. The evaluation will focus on how much training data is required for such a system.
Location: room L307, Lennart Torstenssonsgatan 8
Ivan Sag, Stanford University, will visit Gothenburg on 22nd and 23rd September
Ivan Sag, Sadie Dernham Patek Professor in Humanities and Professor of Linguistics, Stanford University, Senior Researcher, Center for the Study of Language and Information (CSLI), will visit Gothenburg 22nd-23rd September. He will give two talks, both listed at CLT events.
Ivan A. Sag (PhD, MIT 1976) is the Sadie Dernham Patek Professor in
Humanities and Professor of Linguistics and Symbolic Systems at
Stanford University. A fellow of the American Academy of Arts and
Sciences and the Linguistic Society of America, in 2005 he received
the LSA's Victoria Fromkin Prize for distinguished contributions to
the field of linguistics.
Sag's research focuses on syntax, semantics, and language processing.
He is the author or coauthor of 10 books, as well as over 100 articles
in venues as diverse as Linguistics & Philosophy and Nature
Neuroscience. A member of the research teams that invented and
developed GPSG and HPSG, Sag's current research primarily concerns
constraint-based, lexicalist models of grammar, and their relation to
theories of language processing.
Director (2005-2009): Stanford's Symbolic Systems Program
(interdisciplinary program in cognitive and information science).
Senior Researcher: Stanford's Center for the Study of Language and
Professor Honoris Causa, University of Bucharest
Fellow: Center for Advanced Study in the Behavioral Studies
(2002-2003) Research School Logica - the Netherlands (1994),
Ameritch Fellow: University of Chicago (1987-88).
He began his linguistic career studying Indo-European and Sanskrit at
The University of Rochester (BA - 1971) and The University of
Pennsylvania (MA - 1973).
He also serves on the editorial boards of: Journal of Linguistics,
Linguistics and Philosophy, Research on Language and Computation,
Constructions, Semantics and Pragmatics.