![]() Center for Machine Learning and Intelligent Systems |
About
Citation Policy
Donate a Data Set
Contact
View ALL Data Sets |
Source: Anna Glazkova, University of Tyumen, Russia. a.v.glazkova '@' utmn.ru Data Set Information: The corpus was created for the task of automatic search for fragments containing biographical information in a text in a natural language. The corpus includes 200 Russian biographical articles (Wikipedia, 2018).
Attribute Information: The corpus is a text collection, divided into sentences. Each sentence refers to one or two thematic classes: non-biographical fact (none); personal events (personal_events); professional events (professional_events); birth death nationality information about the parental family (parenting)); affiliation education family place of residence, residence (residence); occupation, position (occupation); other biographical facts (other).
Relevant Papers: Glazkova A.V. Automatic search for fragments containing biographical information in a natural language text. Proceedings of the Institute for System Programming of the RAS (Proceedings of ISP RAS). 2018;30(6):221-236. (In Russ.) [Web Link](6)-12
Citation Request: Glazkova A.V. Automatic search for fragments containing biographical information in a natural language text. Proceedings of the Institute for System Programming of the RAS (Proceedings of ISP RAS). 2018;30(6):221-236. (In Russ.) [Web Link](6)-12 |
Supported By: |
![]() |
In Collaboration With: |
![]() |