- Conference
- Program
- Speakers
- Price
- Location
KEYNOTE SPEAKER
Societal Challenges for Information Retrieval
Benno Stein
Biography
Benno Stein is chair of the Web-Technology and Information Systems Group at the Bauhaus-Universität Weimar. His research focuses on modeling and solving data- and knowledge-intensive information processing tasks. Common ground of his research are the principles and methods of symbolic Artificial Intelligence. Benno Stein has developed theories, algorithms, and tools for information retrieval, data mining, computational linguistics, knowledge processing, as well as for engineering design and simulation (patents granted). For several achievements of his research he has been awarded with scientific and commercial prizes.
Professional background: Study at the University of Karlsruhe (1984 - 1989). Dissertation (1995) and Habilitation (2002) in computer science at the University of Paderborn. Appointment as a full professor for Web Technology and Information Systems at the Bauhaus-Universität Weimar (2005). Research stays at IBM, Germany, and the International Computer Science Institute, Berkeley. Benno Stein serves on scientific boards, on program committees, as reviewer in various relevant conferences and journals, and he is the initiator and a co-chair of PAN, an excellence network and evaluation lab on digital text forensics with focus on authorship analysis, profiling, and reuse detection. He is cofounder and spokesperson of the Digital Bauhaus Lab Weimar, a visionary and interdisciplinary research center for Computer Science, Arts, and Engineering. Not least, he is a cofounder (1996) and scientific director of the Art Systems Software Ltd, a world leading company for simulation technology in fluidic engineering.
TUTORIAL
Automatic Text Simplification
Dr. Sanja Štajner
Biography
Sanja Štajner is currently a postdoctoral research fellow at the University of Mannheim, Germany. She holds a multiple Masters degree in Natural Language Processing and Human Language Technologies (Autonomous University of Barcelona, Spain and University of Wolverhampton, UK) and the PhD degree in Computer Science from the University of Wolverhampton on the topic of "Data-driven Text Simplification". She participated in Simplext and FIRST projects on automatic text simplification, and is the lead author of four ACL papers on text simplification (including the first neural text simplification system) and numerous other papers on the topics of text simplification and readability assessment at various leading international conferences and journals.
Sanja regularly teaches NLP at Masters and PhD levels, delivers invited talks and seminars at various universities and companies. She held a tutorial on "Deep Learning for Text Simplification" at RANLP 2017, and tutorial on "Data-Driven Text Simplification" at COLING 2018. She is an area chair for COLING 2018, and regular program committee member of ACL, EMNLP, LREC, IJCAI, IAAA and other international conferences and journals. She was a lead organizer of the first international workshop and shared task on Quality Assessment of Text Simplification (QATS) in 2016, and the Complex Word Identification shared task in 2018.
INDUSTRIAL SESSION
ML problems in crowdsoursing platform Yandex.Toloka.
Vladimir Kukushkin, Yandex
Crowdsourcing platform is a two-sided market where customers and workers look up each other. Customers place their tasks (usually related to machine learning purposes, such as collecting ground-truth labels for their datasets), workers do these tasks gaining money. As a platform, we should regulate their relationships effectively, increasing satisfaction of the both sides. Generally, there are many examples of two-sided markets: Uber, booking.com, AirBnB, etc. But crowdsourcing market has its own specific features: online job, low payments, low entry barrier for users and high entry barrier for customers and many others. The speech is devoted to machine learning problems we face with every day.