Home
Description
Publications

Available Resources
Text Acknowledgements
Related links


Events


CLaRK System

CLaRK System Online Manual


Bulgarian dialects'
electronic archive




eXTReMe Tracker

 

 

 

 

 

 

 


Publications of the BulTreeBank Project


A pre-project publication

Paul John King and Kiril Simov. The automatic deduction of classificatory systems from linguistic theories.
This paper was published in two versions:
Paul John King and Kiril Simov. The Automatic Deduction of Classificatory Systems from Linguistic Theories. (Abridged) In Christian Retore (editor), Proceedings of Logical Aspects of Computational Linguistics. Lecture Notes in Artificial Intelligence 1328, pages 248-273. Springer-Verlag, Berlin, Germany. 1997. 26 pages. Zipped Postscript version
Paul John King and Kiril Simov. The Automatic Deduction of Classificatory Systems from Linguistic Theories. (Revised version). Grammars, 1(2): 103-153. Kluwer Academic Publishers, The Netherlands. 1998. Here you could download a draft of this version (33 pages): Postscript version, Zipped Postscript version.

Technical Reports

Publications in Bulgarian


2005


The Proceedings of the Exploring Syntactically Annotated Corpora Workshop

2004


Kiril Simov and Petya Osenova. A Treebank-Driven Approach to Semantic Lexicons Creation. In: Proceedings of TLT04,Tuebingen, Germany. 2004.


The Proceedings of the ESSLLI 2004 Workshop on Combining Shallow and Deep Processing for NLP


Kiril Simov, Alexander Simov, Petya Osenova. An XML Architecture for Shallow and Deep Processing. In: Proceedings of the ESSLLI 2004 Workshop on Combining Shallow and Deep Processing for NLP. 2004. pages 51-60.


Kiril Simov and Petya Osenova. A Hybrid Strategy for Regular Grammar Parsing. In: Proceedings of LREC 2004, Lisbon, Portugal. 2004. pages 431-434.


Kiril Simov, Petya Osenova, Sia Kolkovska, Elisaveta Balabanova, Dimitar Doikoff. A Language Resources Infrastructure for Bulgarian. In: Proceedings of LREC 2004, Lisbon, Portugal. 2004. pages 1685-1688.


Božo Bekavac, Petya Osenova, Kiril Simov, Marko Tadić. Making Monolingual Corpora Comparable: a Case Study of Bulgarian and Croatian. In: Proceedings of LREC 2004, Lisbon, Portugal. 2004. pages 1187-1190.


Kiril Simov, Alexander Simov, Hristo Ganev, Krasimira Ivanova, Ilko Grigorov. The CLaRK System: XML-based Corpora Development System for Rapid Prototyping. In: Proceedings of LREC 2004, Lisbon, Portugal. 2004. pages 235-238.


Tylman Ule, Kiril Simov. Unexpected Productions May Well be Errors. In: Proceedings of LREC 2004, Lisbon, Portugal. 2004. pages 1795-1798.


Kiril Simov, Petya Osenova, Alexander Simov, Krasimira Ivanova, Ilko Grigorov, Hristo Ganev. Creation of a Tagged Corpus for Less-Processed Languages with CLaRK System. In: Proceedings of SALTMIL Workshop at LREC 2004: First Steps in Language Documentation for Minority Languages, Lisbon, Portugal. 2004. pages 80-83.


2003


Kiril Simov, Petya Osenova, Sia Kolkovska, Elisaveta Balabanova, Dimitar Doikoff. Language resources for the creation of a Bulgarian Treebank In: Workshop on Balkan Language Resources and Tools, 21 November 2003, Thessaloniki, Greece (satellite event to the Balkan Conference on Informatics - BCI 2003). 2003.


Kiril Simov, Alexander Simov, Krassimira Ivanova, Ilko Grigorov, Hristo Ganev. The CLARK System Tools XML based Corpora development In: Workshop on Balkan Language Resources and Tools, 21 November 2003, Thessaloniki, Greece (satellite event to the Balkan Conference on Informatics - BCI 2003). 2003.


Petya Osenova and Kiril Simov. The Bulgarian HPSG Treebank: Specialization of the Annotation Scheme. In: Proc. of The Second Workshop on Treebanks and Linguistic Theories (TLT2003), 14-15 November 2003, Växjö, Sweden.


Kiril Simov. HPSG-Based Annotation Scheme for Corpora Development and Parsing Evaluation. In: Proc. of the RANLP 2003 Conference, Borovets, Bulgaria, 10-12 September 2003. pages 432-439.


Kiril Simov and Petya Osenova. Practical Annotation Scheme for an HPSG Treebank of Bulgarian. In: Proc. of the 4th International Workshop on Linguistically Interpreteted Corpora (LINC-2003), Budapest, Hungary. 2003.


Kiril Simov, Alexander Simov, Milen Kouylekov, Krasimira Ivanova, Ilko Grigorov, Hristo Ganev. Development of Corpora within the CLaRK System: The BulTreeBank Project Experience. In: Proc. of the Demo Sessions of the 10th Conference of the European Chapter of the Association for Computational Linguistics (EACL'03), Budapest, Hungary. 2003.


Tomaz Erjavec, Cvetana Krstev, Kiril Simov, Marko Tadic, Dusko Vitas. The MULTEXT-East Morphosyntactic Specifications for Slavic Languages. In: Proc. of the Workshop on Morphological Processing of Slavic Languages at EACL-2003, Budapest, Hungary. 2003.


The Proceedings of the Shallow Processing of Large Corpora (SProLaC 2003) Workshop


Petya Osenova and Kiril Simov Between Chunk Ideology and Full Parsing Needs In: Proceedings of the Shallow Processing of Large Corpora (SProLaC 2003) Workshop, Lancaster, UK. pages: 78-87.


Kiril Simov, Alexander Simov, Milen Kouylekov. Constraints for Corpora Development and Validation. In: Proc. of the Corpus Linguistics 2003 Conference, pages: 698-705.


2002


Kiril Simov, Petya Osenova, Sia Kolkovska, Elisaveta Balabanova, Dimitar Doikoff, Krassimira Ivanova, Alexander Simov, Milen Kouylekov. Building a Linguistically Interpreted Corpus of Bulgarian: the BulTreeBank. In: Proceedings of LREC 2002, Canary Islands, Spain. 2002. pages 1729-1736. (Zipped Postscript version)


Kiril Simov, Milen Kouylekov, Alexander Simov. Incremental Specialization of an HPSG-Based Annotation Scheme. In: Proceedings of LREC 2002 Workshop on "Linguistic Knowledge Acquisition and Representation: Bootstrapping Annotated Language Data", Canary Islands, Spain. 2002. pages 16-23. (Zipped Postscript version, Zipped PDF version, PDF version)


Kiril Simov. Grammar Extraction and Refinement from an HPSG Corpus. In: Proc. of the ESSLLI Workshop on Machine Learning Approaches in Computational Linguistics, Trento, Italy. August 5-16, 2002. pages 38-55. (Zipped Postscript version).


Petya Osenova and Kiril Simov. Learning a token classification from a large corpus. (A case study in abbreviations). In: Proc. of the ESSLLI Workshop on Machine Learning Approaches in Computational Linguistics, Trento, Italy. August 5-16, 2002. pages 16-28. (Zipped Postscript version).


Petya Osenova and Kiril Simov. Bulgarian Vocative within HPSG framework. In: Proc. of the 9th International Conference on Head-Driven Phrase Structure Grammar (HPSG), Kyung Hee University, Seoul, South Korea. August 8-9, 2002. pages 94-100. (Postscript version, Zipped Postscript version, Zipped PDF version).


Kiril Simov, Milen Kouylekov, Alexander Simov. Cascaded Regular Grammars over XML Documents. In: Proc. of the 2nd Workshop on NLP and XML (NLPXML-2002), Taipei, Taiwan. September 1, 2002. pages 51-58. (Postscript version, Zipped Postscript version, Zipped PDF version).


Elisaveta Balabanova and Krassimira Ivanova. Creating a machine-readable version of Bulgarian valence dictionary: (A case study of CLaRK system application). In: Proc. of The First Workshop on Treebanks and Linguistic Theories (TLT2002), 20th and 21st September 2002, Sozopol, Bulgaria. pages 1-12.


Krassimira Ivanova and Dimitar Doikoff. Cascaded Regular Grammars and Constraints over Morphologically Annotated Data for Ambiguity Resolution. In: Proc. of The First Workshop on Treebanks and Linguistic Theories (TLT2002), 20th and 21st September 2002, Sozopol, Bulgaria. pages 96-113.


Petya Osenova. Bulgarian Nominal Chunks and Mapping Strategies for Deeper Syntactic Analyses. In: Proc. of The First Workshop on Treebanks and Linguistic Theories (TLT2002), 20th and 21st September 2002, Sozopol, Bulgaria. pages 150-166.


Petya Osenova and Sia Kolkovska. Combining the named-entity recognition task and NP chunking strategy for robust pre-processing. In: Proc. of The First Workshop on Treebanks and Linguistic Theories (TLT2002), 20th and 21st September 2002, Sozopol, Bulgaria. pages 167-182.


Kiril Simov, Alexander Simov, Milen Kouylekov, Krassimira Ivanova. CLaRK System: Construction of Treebanks. In: Proc. of The First Workshop on Treebanks and Linguistic Theories (TLT2002), 20th and 21st September 2002, Sozopol, Bulgaria. pages 183-198.


Milena Slavcheva. Segmentation Layers in the Group of the Predicate: a Case Study of Bulgarian within the BulTreeBank Framework. In: Proc. of The First Workshop on Treebanks and Linguistic Theories (TLT2002), 20th and 21st September 2002, Sozopol, Bulgaria. pages 199-210.


The Proceedings of the Treebanks and Linguistic Theories 2002 Workshop

2001


Kiril Simov, Zdravko Peev, Milen Kouylekov, Alexander Simov, Marin Dimitrov, Atanas Kiryakov. CLaRK - an XML-based System for Corpora Development. In: Proc. of the Corpus Linguistics 2001 Conference, pages: 558-560. Zipped PDF version


Kiril Simov, Gergana Popova, Petya Osenova. HPSG-based syntactic treebank of Bulgarian (BulTreeBank). In: Proc. of the Corpus Linguistics 2001 Conference, 561. (abstract)

Kiril Simov, Gergana Popova, Petya Osenova. HPSG-based syntactic treebank of Bulgarian (BulTreeBank). In: "A Rainbow of Corpora: Corpus Linguistics and the Languages of the World", edited by Andrew Wilson, Paul Rayson, and Tony McEnery; Lincom-Europa, Munich 2002. (full version) pages 135-142


Milena Slavcheva. Review of Cann, Ronnie, Claire Grover and Philip Miller, ed. (2000) Grammatical Interfaces in HPSG Linguist List: Vol-12-1900, 12.1900.


Kiril Simov, Petya Osenova. A Hybrid System for MorphoSyntactic Disambiguation in Bulgarian. In: Proc. of the RANLP 2001 Conference, Tzigov Chark, Bulgaria, 5-7 September 2001. pages 288-290 (Postscript version, Zipped Postscript version, Zipped PDF version) A full version of the paper can be found here: (Postscript version, Zipped Postscript version) and (PDF version, Zipped PDF version).


Kiril Simov. Grammar Extraction from an HPSG Corpus. In: Proc. of the RANLP 2001 Conference, Tzigov Chark, Bulgaria, 5-7 September 2001. pages 285-287 (Postscript version, Zipped Postscript version, Zipped PDF version).


Petya Osenova and Kiril Simov. Review of Minnen, Efficient Processing with Constraint-Logic Grammars Using Grammar Compilation LINGUIST List: Vol-12-3097. Sat Dec 15 2001.


Petya Osenova. On Subject-Verb Agreement in Bulgarian (An HPSG-based account). In: Proc. of the fourth Formal Description of Slavic Languages Conference, Potsdam, Germany, 2004. pages 661-672.