Segment-statistical approach to the Internet as a case - a new workshop in the ABBYY Open series

    imageWe continue the series of workshops on computer linguistics ABBYY Open . The next event will be held on January 31 at 17.00 in the Moscow office of ABBYY. The topic is “Segment-statistical approach to the Internet as a corpus (using the blogosphere as an example).” The seminar will be addressed by Vladimir Belikov, Doctor of Philology, Associate Professor of the Department of Theoretical and Applied Linguistics, Faculty of Philology of Moscow State University, Leading Researcher at the Russian Language Institute of the Russian Academy of Sciences.

    His talk is about smart methods for extracting reliable linguistic information from the Internet. The report provides a comparative analysis of the Russian National Corpusand various Internet corps as sources of information about the Russian lexical Uzus of various types. Based on the material of Russian explanatory dictionaries and individual linguistic studies, typical errors and inaccuracies that result from ignoring modern corpus methods in lexicography are analyzed.

    The segmental structure of the Russian-speaking blogosphere is considered, the various results of its analysis by the segmental-statistical method are demonstrated in the study of the synchronous state and dynamics of changes in all-Russian and regional vocabulary, phraseology, and grammar. The methodology of a linguistically oriented search in the blogosphere and methods for overcoming the difficulties arising from this are described in detail.

    For more information and registration, see the ABBYY Open page .

    Update: a video of the seminar is available here .

    Also popular now: