Personal Details
|
| Date of birth |
May 22, 1971 |
| Place of birth |
Pleven, Bulgaria |
| Home address |
6 Gorski patnik Str., bl. 1A, ap.5, Sofia 1421 |
| Home telephone number |
(+3592) 865-80-62 |
| E-mails |
petya@bultreebank.org |
|
|
|
|
Education
|
| 1996-1999 |
PhD in Linguistics, Institute for
Bulgarian Language, Bulgarian Academy of Sciences.
Title of the PhD Thesis: "Semantics and functioning of Indefinite pronouns in
Bulgarian" (abstract in Bulgarian) |
| 1989-1995 |
Sofia University "St. Kliment
Ohridski" MA in Czech language and literature 1995, MA in Bulgarian language
1995, Minor subject in English language and literature 1995 |
| 1985-1989 |
Mathematical Secondary School, Pleven |
|
Scholarships
|
| 1992-1993 |
One year scholarship from The Open Society Foundation, Bulgaria |
| 1993 |
One year scholarship from The National Academic Foundation, Bulgaria |
|
Additional Courses and Workshops
|
| 14 July 2005 |
Workshop on Exploring Syntactically Annotated Corpora, Birmingham,
UK (co-organizer) |
| 9-20 April 2004 |
ESSLLI 2005, Nancy, France (participant) |
| 13-14 April 2003 |
Workshop on Linguistically Interpreted Corpora (LINC4), Budapest, Hungary (participant) |
| 27 March 2003 |
Workshop on Shallow Processing of Large Corpora (SProLaC2003), Lancaster University, England (co-organizer) |
| 24-26 Sept. 2002 |
International Workshop on Electronic Description and Edition of Slavic Resources, 24-26 Sept., Pomorie, Bulgaria (participant) |
| 20-21 Sept 2002 |
The First Workshop on Treebanks and Linguistic Theories (TLT2002), Sozopol, Bulgaria (one of the local organizers) |
| 8-19 Sept 2002 |
Fall School on Empirical Linguistics and
Natural Language Processing, Sozopol, Bulgaria |
| 5-6 Aug 2002 |
ESSLI Workshop on Machine Learning Approaches in Computational Linguistics, Trento, Italy, 5-16 Aug (participant) |
| Jan-March 2002 |
Specialization at the Department of Computational Linguistics (SfS),
Tuebingen University |
| 15 - 20 October 2001 |
How to use corpora in language teaching, Tuscan Word Center, Italy |
| 16 - 27 July 2001 |
ELSNET summer school on Corpus Linguistics, Prague, Czech Republic |
| 25 Aug - 8 Sept 2000 |
Summer School on Computational Linguistics and Represented Knowledge, CLaRK Programme, Sozopol, Bulgaria |
| 20 - 31 March 2000 |
Vilem Mathesius Lecture Series 15, Prague, Czech Republic |
| 13 - 19 Sept. 1998 |
Summer School "Contacts in Science" - Pamporovo, Bulgaria,
organized by Swedish Council for Research and Bulgarian Academy of Sciences |
| March - June 1995 |
Scholarship at Charles University, Prague, Czech Republic |
| July - August 1993 |
Summer School in Olomouc, Czech Republic |
|
Experience
|
| 1st February 2001 - present |
Postdoctoral at the BulTreeBank
Project, LML, Bulgarian
Academy of Sciences |
| January 2004 - December 2004 |
Postdoctoral at the Measuring Language Contact
Project, Groningen, the Netherlands |
| 1st May, 2000 - present |
Assistant Professor in morphology and syntax at the Division of Bulgarian
Language, Faculty of
Slavonic languages, St. Kl. Ohridski University |
| July 1999 - April 2000 |
Research Fellow, Section for Bulgarian lexicology and lexicography, Bulgarian Academy of Sciences |
| 1999-2000 |
Part-time Lecturer in Bulgarian syntax at the Division of Bulgarian
Language, Faculty of
Slavonic languages, St. Kl. Ohridski University |
| 1997-1998 |
Part-time Lecturer in Czech language at the Faculty of Slavonic languages,
St. Kl. Ohridski University |
|
Program Committees
|
| 2005 |
RANLP 2005 Workshop "Language and Speech Infrastructure for Information Access in the Balkan Countries", Borovets, Bulgaria |
| 2005 |
TLT 2005 Barcelona, Spain |
| 2005 |
ESSLLI 2005 Student Workshop, Edinburgh, UK |
| 2004 |
ESSLLI 2004 Workshop on Combining Shallow and Deep Processing for NLP, Nancy, France |
| 2003 |
ACL 2003 Student Research Workshop |
| 2003 |
Second Workshop on Treebanks and Linguistic theories, Sweden, November 2003 |
| 2003 |
WORKSHOP IESL-2003 on Information Extraction for Slavonic and other Central and Eastern European Languages, Borovets, Bulgaria |
| 2003 |
Workshop on Shallow Processing of Large Corpora (SProLaC2003), Lancaster University, England |
| 2003 |
First Workshop on Treebanks and Linguistic theories, Bulgaria, November 2002 |
|
Teaching
|
| 2003 - 2004 |
MA Course on Syntactic Models (HPSG), Faculty of Classic and New
Philologies,
St. Kl. Ohridski University |
| Every summer semester |
Seminar on Computational Corpus Linguistics at the Faculty of Slavonic languages,
St. Kl. Ohridski University, (together with Kiril
Simov, BulTreeBank, LML, Bulgarian Academy of
Sciences, and Krasimira Alexova, St. Kl. Ohridski
University) |
| 1999 - present |
Course on Bulgarian Syntax at the Faculty of Slavonic languages,
St. Kl. Ohridski University |
| 1999 - present |
Course on Bulgarian Morphology and Syntax at the Faculty of Slavonic languages,
St. Kl. Ohridski University |
|
Projects
|
| Question Answering for Bulgarian, CLEF 2005
|
I participated actively in the preparation of Bulgarian corpus,
Bulgarian questions and topics. I also was involved in the assessment of other
groups' topics and questions in Bulgarian corpus.
|
| February 1, 2001 – present: BulTreeBank
Project |
I am a Postdoctoral in the BulTreeBank
Project (An HPSG-based treebank of Bulgarian), a joint project between Linguistic Modelling Laboratory, Bulgarian Academy of Sciences and SfS, Tuebingen, Germany. |
| January 2004 – December 2004, Measuring Language Contact
Project |
I was a Postdoctoral at Alfa-Informatika, Groningen University.
I did some work on measuring distances between some Bulgarian dialects on pnonetic
level. I collaborated with prof. John Nerbonne, Wilbert Heeringa, Peter Kleiweg.
|
| October, 1999 - October, 2000: "Sentence corpus for morpho-syntactic
disambiguation of Bulgarian" within the CLaRK Programme |
I selected sentences from real texts which demonstrate the most frequent
ambiguities on the morpho-syntactic level after completion of the automatic morphological
analysis of Bulgarian texts. The selected sentences were marked-up with morpho-syntactic
information and the right analysis was indicated. I am responsible for the selection and
for the manual disambiguation. At the moment the corpus consists of about 2 500 sentences
marked-up with part-of-speech information. A new set of sentences is under compilation. |
| August,1999 - present: "Computer-based inflectional morphological
dictionary of Bulgarian" |
Together with Kiril Simov and Svetlomira Vidinska I am working on the
extension of the computer version of the Bulgarian inflectional morphological dictionary.
The main goal of the project is to extend the vocabulary to cover the paper and electronic
archives at the Institute of Bulgarian Language, Bulgarian Academy of Sciences. My responsibility is to
classify the new lexemes according to the morphological classes. The next task will be to
improve the classification according to the new morphological classes if there are such. |