The Internet as a building - a new workshop in the ABBYY Open series


    Next Tuesday, July 19, the next seminar in the ABBYY Open series “Actual Problems of Computer Linguistics” will be held at the Moscow office of ABBYY. The seminar will be delivered by Sergey Sharov, an employee of the Department of Translation at the University of Leeds (Great Britain), who previously worked at the Russian Research Institute of Artificial Intelligence and the Russian Language Institute, RAS. His report “Web as Corpus, Approaches to the Quantitative and Qualitative Analysis of Internet Text Content” is devoted to methods for collecting linguistic bodies on the Internet, assessing the quality of these methods, and considering approaches to automatic text classification.

    The seminar will describe how to quickly collect cases in the desired area, approaches to automatic classification of texts by subject area and genre using methods such as Support Vector Machines (SVM), Topic Modeling, Multidimensional Scaling. In addition to quantifying the quality of methods, it is also necessary to conduct a qualitative assessment of the conformity of the results of the classification of linguistic intuition. The workshop will provide examples of using methods for creating and processing cases for Russian, English, Chinese and German.

    You can read more about the event here . The seminar is free, for participation it is necessary to register and wait for confirmation of registration.

    UPD:Video from the workshop can be found here.

    Also popular now: