SEMINAR
In my talk I will briefly describe what CLARIN wants to achieve and what needs to be done in the participating countries to transform the present fragmented European landscape to a Schengen area for access to language resources and technology for the humanities and social sciences research community
Location: L308, Lennart torstenssonsgatan 8
SEMINAR
For more information, see the following publication: http://www.linguistics.ucsb.edu/faculty/stgries/research/2010_STG_BehavProf_TheMentalLexicon.pdf
Location: L308, Lennart torstenssonsgatan 8
SEMINAR
In this talk we will tell you more about Findwise as a company and how we design search solutions. We will focus on the collection and processing of the data as well as its presentation to the end user. Our focus is on bring most value to the customers give their structured and unstructured data.
Location: L308, Lennart torstenssonsgatan 8
SEMINAR
Frequency lists and frequency-based lexical resources contain information about the words and their statistics. They tend to find their "readers" among language learners, language teachers, linguists and lexicographers. Making them available in electronic format helps to expand the target group to cover language engineers, computer programmers and other specialists working in such areas as information retrieval, spam filtering, text readability analysis, test generation etc.
In this presentation we describe a new freely available electronic frequency list of modern Swedish which was created in the EU project KELLY. We provide a short description of the KELLY project; examine the methodological approach and provide some details on the compiling of the web-corpus SweWAC from which the list has been derived. Further, we discuss the type of information the list contains, describe the steps for list generation; provide information on the coverage and some other statistics over the items in the list. Finally, we suggest some potential application areas; and mention future plans. We hope that with some publicity we can help this list find its users.
Location: L308, Lennart torstenssonsgatan 8
SEMINAR
In January 2012 Karin Friberg Heppin and Ann-Marie Eklund participated in PROMISE winter school on Information Retrieval meets Information Visualization. In this presentation we will briefly describe the two main topics of the winter school, Information Retrieval and Information Visualization. We will also present a few highlights from the lectures.
http://www.promise-noe.eu/events/winter-school-2012
Location: L308, Lennart torstenssonsgatan 8
SEMINAR
Our aim is to detect whether texts written in natural language contain normative conflicts (i.e., whether there are conflicting obligations, permissions and prohibitions). In this talk we present AnaCon, a framework where such texts are written in Controlled Natural Language (CNL) and automatically translated into the formal language CL using the Grammatical Framework (GF). In AnaCon such CL expressions are analyzed for normative conflicts by the tool CLAN which gives a counter-example in case a conflict is found. Our framework uses GF to give a CNL version of the counter-example helping the user to identify the conflicts in the original text.
The talk will be divided in two parts. In the first part we will introduce AnaCon as well as relevant background on CL and CLAN. In the second part we will we show the usability of AnaCon on two case studies.
Location: EDIT room, EDIT building, Chalmers
SEMINAR
The talk reports recent work with Tom Kwiatkowski, Sharon Goldwater, and Luke Zettlemoyer on semantic parser induction by machine from a number of corpora pairing sentences with logical forms, including GeoQuery and a corpus consisting of real child-directed utterance from the CHILDES corpus.
The problem of semantic parser induction and child language acquisition are both similar to the problem of inducing a grammar and a parsing model from a treebank such as the Penn treebank, except that the trees are unordered logical forms, in which the preterminals are not aligned with words in the target language, and there may be noise and spurious distracting logical forms supported by the context but irrelevant to the utterance.
The talk shows that this class of problem can be solved if the child or machine initially parses with the entire space of possibilities that universal grammar allows under the assumptions of the Combinatory Categorial theory of grammar (CCG), and learns a statistical parsing model for that space using EM-related methods such as Variational Bayes learning.
This can be done without all-or-none "parameter-setting" or attendant "triggers", and without invoking any "subset principle" of the kind proposed in linguistic theory, provided the system is presented with a representative sample of reasonably short string-meaning pairs from the target language.
Location: L308, Lennart torstenssonsgatan 8
SEMINAR
Planning event for the CLT seminar series Spring semester
Location: L308, Lennart torstenssonsgatan 8