Not enough data to create a plot.
Try a different view from the menu above.
Ozertem, Umut
Preserving Integrity in Online Social Networks
Halevy, Alon, Ferrer, Cristian Canton, Ma, Hao, Ozertem, Umut, Pantel, Patrick, Saeidi, Marzieh, Silvestri, Fabrizio, Stoyanov, Ves
Online social networks provide a platform for sharing information and free expression. However, these networks are also used for malicious purposes, such as distributing misinformation and hate speech, selling illegal drugs, and coordinating sex trafficking or child exploitation. This paper surveys the state of the art in keeping online platforms and their users safe from such harm, also known as the problem of preserving integrity. This survey comes from the perspective of having to combat a broad spectrum of integrity violations at Facebook. We highlight the techniques that have been proven useful in practice and that deserve additional attention from the academic community. Instead of discussing the many individual violation types, we identify key aspects of the social-media eco-system, each of which is common to a wide variety violation types. Furthermore, each of these components represents an area for research and development, and the innovations that are found can be applied widely.
Enhanced End-of-Turn Detection for Speech to a Personal Assistant
Arsikere, Harish (Xerox Research Center India) | Shriberg, Elizabeth (SRI International) | Ozertem, Umut (Microsoft)
Speech to personal assistants (e.g., reminders, calendar entries, messaging, voice search) is often uttered under cognitive load, causing nonfinal pausing that can result in premature recognition cut-offs. Prior research suggests that prepausal features can discriminate final from nonfinal pauses, but it does not reveal how speakers would behave if given longer to pause. To this end, we collected and compared two elicitation corpora differing in naturalness and task complexity. The Template Corpus (4409 nonfinal pauses) uses keyword-based prompts; the Freeform Corpus (8061 nonfinal pauses) elicits open-ended speech. While nonfinal pauses are longer and twice as frequent in the Freeform data, prepausal feature modelling is roughly equally effective in both corpora. At a response latency of 100 ms, prepausal features modelled by an SVM reduced cut-off rates from 100% to 20% for both corpora. Results have implications for enhancing turn-taking efficiency and naturalness in personal-assistant technology.