Marco Baroni:
Education and Academic/Professional History
Education
- Ph.D. in Linguistics, University of California, Los Angeles, June
2000
Dissertation Title: Distributional cues in morpheme discovery:
A computational model and empirical evidence
Dissertation
Committee: Bruce Hayes (chair), Carson Schütze, Edward Stabler,
Donca Steriade, Jody Kreiman
- M.A. in Linguistics, University of California, Los
Angeles. December 1997
Thesis Title: The representation of prefixed
forms in the Italian lexicon: Evidence from the distribution of
intervocalic [s] and [z]
Thesis Committee: Bruce Hayes (chair),
Sun-Ah Jun, Carson Schütze, Donca Steriade
- Laurea in Linguistica ("110 e lode"), University of
Padua, Italy, April 1995
Thesis Title: La relazione tra struttura
segmentale e costituenza moraica [The relation between segmental
structure and moraic constituency]
Thesis Co-Chairs: Alberto Mioni
and Laura Vanelli
Work Experience
- November 2006 - present
Researcher (tenured position)
Center for Mind/Brain Sciences (CIMeC)
Deparment of Cognitive and Education Sciences (DISCoF)
School of Letters and Philosophy
Università di Trento
CIMeC website: http://www.cimec.unitn.it
- October 2002 - October 2006
Researcher (tenured position)
Dipartimento di Studi Interdisciplinari su Traduzione, Lingue e
Cultura (SITLEC)
Università di Bologna (Sede di
Forlì), Italy
SITLEC website: http://www.sitlec.unibo.it
- September 2001 - August 2002
Researcher (position funded by EU
R&D project FASTY)
Natural Language Processing Group
Austrian
Research Institute for Artificial Intelligence (ÖFAI)
Vienna,
Austria
ÖFAI NLP group website: http://www.ai.univie.ac.at/oefai/nlu/
FASTY project website: http://www.fortec.tuwien.ac.at/reha.e/projects/fasty/fasty.html
- July 2000 - August 2001
Computational Linguist
Language
Development Team / Core Technologies Team
Conversay
Redmond
WA, USA
Conversay website: http://www.conversay.com/
- January - December 1999
Research Assistant to Prof. Pat
Keating (position funded by NSF project KDI)
Phonetics
Laboratory
Department of Linguistics
University of California,
Los Angeles
Los Angeles CA, USA
KDI project website: http://www.hei.org/research/projects/comneur/kdipage.htm
- Summer 1998
Summer Research Intern
Spoken Language
Processes Laboratory
House Ear Institute
Los Angeles CA,
USA
Teaching
- Fall 2008 - Present
Linear Models in R module of the
Computational and Statistical Methods for Data modelling and
Analysis course
Doctoral Schools in Cognitive and Brain
Sciences and Psychological Sciences and Education, University of
Trento
- Fall 2007 - Present
Coordinator and co-instructor of the
Text Processing I course
Philosophy and Informatics Master, School of Letters and
Philosophy, University of Trento
International Cognitive Science Master, School of Cognitive
Science, University of Trento
Master in Human Language Technologies and Interfaces, University of
Trento
- Fall 2009 - Present
Perl Programming for Text Analysis,
Part II of Informatica per le Discipline Umanistiche e
Linguistiche
Philosophy and Informatics Bachelor, School of Letters and
Philosophy, University of Trento
- Winter 2007 - Winter 2009
Computational Lexicography lab of
Humanities Computing course
School of Letters and
Philosophy, University of Trento
- Fall 2007 - Winter 2009
Introduction to Perl for Text
Processing (Humanities Computing module)
School of Letters and
Philosophy, University of Trento
- Winter 2009
Co-coordinator and co-instructor of the the
Topic Seminar SExIE, Seminar on Extreme Information
Extraction
Doctoral School in Cognitive and Brain Sciences,
University of Trento
- Winter 2009
Lexicography module of Applied Linguistics
course
School of Letters and Philosophy, University of
Trento
- Winter 2008
Lexical Semantics module of General
Linguistics course
School of Letters and Philosophy,
University of Trento
- Fall 2007 - Winter 2008
Co-coordinator and co-instructor
of the Topic Seminar EviL, Evidence in Linguistics
Doctoral School in Cognitive and Brain Sciences, University of
Trento
- Winter 2007
Introduction to Corpora module of
Humanities Computing courses
School of Letters and Philosophy,
University of Trento
- Winter 2007
Collocations module of Applied Linguistics
course
School of Letters and Philosophy, University of
Trento
- Winter 2005 - Winter 2006
Automated Acquisition of Lexicon and
Terminology module of Terminology and Specialized Languages (I and
II).
SSLMIT, Università di Bologna
- Winter 2004 - Fall 2005
Computational Linguistics
SSLMIT, Università di Bologna
- Fall 2002 - Fall 2006
Phonetics/Phonology/Morphology
modules of General Linguistics course
SSLMIT,
Università di Bologna
- Fall 1996 - Fall 1998
Teaching Assistant for the courses
Introduction to Linguistics, Experimental Phonetics and
Introduction to General Phonetics
Department of Linguistics,
University of California, Los Angeles
Other Activities
- My team participated in the EVALITA 2009 Lexical Substitution track
- Taught mini-course on Distributional Semantics at
the GLIF center of the Universitat Pompeu Fabra,
Barcelona, June 2009
- In program committee of ESSLLI 2009, Bordeaux, July 2009
- Taught at the TRIPLE Winter School on The lexicon: analysis methods, models and applications, January 2009
- Co-organizer of
the ESSLLI
2008 Distributional Lexical Semantics Workshop,
Hamburg, August 2008
- Co-taught mini-course on Statistical programming in R for
computational linguists at
the Computational
Linguistics Fall School of the German Linguistics Association,
University of Potsdam, September 2007
- My team participated in
the EVALITA 2007 initiative in
the POS Tagging track: our system was a close second in the evaluation
- Co-organizer of
the quantitative/computational
curriculum of the Philosophy program of the University of
Trento
- Co-organizer of
the Contextual
Information in Semantic Space Models workshop at Context 07,
Roskilde University, August 2007
- Invited visiting scholar at the National Institute for Japanese
Language, Tokyo, Japan, July-August 2007
- Co-coordinator of
the CLEANEVAL
shared task on automated cleaning of Web data
- Co-organizer of
the LCT
Colloquia of the Universities of Bolzano and Trento
- Secretary
of SIGWAC, the Special
Interest Group on Web as Corpus of the Association for Computational
Linguistics
- Co-taught mini-course on Counting words: an introduction to
lexical statistics at ESSLLI 2006, Malaga, August 2006
- Taught mini-course Morphology and corpora: the case of
quantitative productivity at the University of Granada, May
2006
- Co-organizer of workshop on The Web as Corpus, EACL 2006,
Trento, April 2006
- Co-taught intensive mini-course Statistical Methods for Corpus
Exploitation at EURAC, Bolzano, October 2005
- Co-organizer of workshop on The Web as Corpus, Corpus
Linguistics 2005, Birmingham
(http://sslmit.unibo.it/~baroni/web_as_corpus_cl05.html),
July 2005
- Visiting scholar at the Austrian Research Institute for Artificial
Intelligence (ÖFAI), Vienna, Austria, May-August 2005
- Taught mini-course Statistics for Corpus Linguistics,
SITLEC, Forlì, April-May 2005
- Co-coordinator of the WaCky project (http://wacky.sslmit.unibo.it/)
- Co-organized workshop on The Web as Corpus, SSLMIT/SITLEC,
January 2005
- Administrator of the
site http://e-learning.sslmit.unibo.it/,
2003-2006
- Secretary of the entrance exam committee, SSLMIT, September 2003,
September 2004, September 2005.
- Co-organized and co-taught intensive mini-course A Practical
Introduction to Corpus Work, Bertinoro University Center, October
2003
- Co-coordinated the CORAL (CORpora e Apprendimento
Linguistico) e-learning project (http://www.e-learning.sslmit.unibo.it/COR/)
- Helped organizing the Interdepartmental Workshop on Science and
Common Sense, University of Padua, May 1995
- I contributed to the proposals of the following funded projects:
PAISA' (FIRB project, 2009-2011), LiveMemories (PAT project,
2008-2010), CompoNet (PRIN project, 2006-2008), LiMiNE (University of
Bologna strategic programme, 2007-2009)
- Reviewer for: Natural Language Engineering
(2009), Generative Lexicon (2009), CogSci Distributional
Semantics beyond Concrete Concepts Workshop (2009), ACL
(2008-2009), EACL GEMS Workshop (2009), EACL Cognitive
Aspects of Computational Language Acquisition Workshop
(2009), RANLP Workshop on NLP Methods and Corpora in Translation,
Lexicography, and Language Learning (2009), EACL
(2008), IEEE Intelligent Systems (2008), COLING
(2008), Human Judgments in Computational Linguistics Workshop at
COLING (2008), LREC (2008, 2010), IJCNLP (2008),
the ESSLLI Student Workshop (2008), Quantitative
Investigations in Theoretical Linguistics 3 (QITL-3)
(2008), Language Resources and Evaluation Journal
(2007-2008), Italian Journal of Linguistics
(2008), Cognitive Linguistics (2008), the UK Economic and
Social Research Council (ESRC) (2007), Europhysics Letters
(2007), Artificial Intelligence Journal (2007), WAC3
Workshop (2007), the US National Science Foundation (NSF)
(2005-2007), Morphology (2007), EMNLP-CoNLL07
(2007), AMML Workshop at RANLP 2007 (2007), Web Genres
Colloquim at Corpus Linguistics 2007 (2007),
Corpus linguistics: An international handbook
(2006), Languages in Contrast (2006), Quantitative
Investigations in Theoretical Linguistics 2 (QITL-2)
(2006), Journal of the International Phonetic Association
(2002, 2003), Phonetica (2001), Journal of the Acoustical
Society of America (2000),
Journal of Phonetics (1998)
Honors and Scholarships
- Invited Scholar Fellowship, the National Institute for Japanese
Language, Tokyo, 2007
- Marco Polo Scholarship for a research period abroad, University of
Bologna, 2005
- Chancellor Fellowship, University of California, Los Angeles,
1995-2000
- Summer School Fellowship, San Marino Center for Semiotic and
Cognitive Studies, 1995
- Education Abroad Program Fellowship, University of California, Los
Angeles, 1993-1994
- Summer School Fellowship, University of Bucharest, Romania,
1993
Back to Marco's Page.