Information Retrieval
Medical Information Retrieval and Interpretation: A Question-Answer based Interaction Model
Sinhababu, Nilanjan, Saxena, Rahul, Sarma, Monalisa, Samanta, Debasis
The Internet has become a very powerful platform where diverse medical information are expressed daily. Recently, a huge growth is seen in searches like symptoms, diseases, medicines, and many other health related queries around the globe. The search engines typically populate the result by using the single query provided by the user and hence reaching to the final result may require a lot of manual filtering from the user's end. Current search engines and recommendation systems still lack real time interactions that may provide more precise result generation. This paper proposes an intelligent and interactive system tied up with the vast medical big data repository on the web and illustrates its potential in finding medical information.
Google is threatening to pull its search engine out of Australia
Google and Facebook have been in a long-running fight with Australian politicians, regulators and media companies over whether they should pay news organizations for showing their stories in search results. The battle reached a new level of intensity when a Google executive threatened to pull out of the country during testimony at the Australian Senate.
Google's threat to withdraw its search engine from Australia is chilling to anyone who cares about democracy Peter Lewis
Google's testimony to an Australian Senate committee on Friday threatening to withdraw its search services from Australia is chilling to anyone who cares about democracy. It marks the latest escalation in the globally significant effort to regulate the way the big tech platforms use news content to drive their advertising businesses and the catastrophic impact on the news media across the world. The news bargaining code, which would require Google and Facebook to negotiate a fair price for the use of news content, is the product of an 18-month process driven by the competition regulator. That legislation is currently before the Australian parliament, where a Senate committee is taking final submissions from interested parties. The Google bombshell makes explicit what has been a slowly escalating threat that a binding code would not be tenable.
privacy?
DuckDuckGo, a search engine focused on privacy, increased its average number of daily searches by 62% in 2020 as users seek alternatives to impede data tracking. The search engine, founded in 2008, operated nearly 23.7 billion search queries on their platform in 2020, according to their traffic page. On Jan. 11, DuckDuckGo reached its highest number of search queries in one day, with a total of 102,251,307. DuckDuckGo does not track user searches or share personal data with third-party companies. "People are coming to us because they want more privacy, and it's generally spreading through word of mouth," Kamyl Bazbaz, DuckDuckGo vice president of communications, told USA TODAY.
DuckDuckGo search engine increased its traffic by 62% in 2020 as users seek privacy
DuckDuckGo, a search engine focused on privacy, increased its average number of daily searches by 62% in 2020 as users seek alternatives to impede data tracking. The search engine, founded in 2008, operated nearly 23.7 billion search queries on their platform in 2020, according to their traffic page. On Jan. 11, DuckDuckGo reached its highest number of search queries in one day, with a total of 102,251,307. DuckDuckGo does not track user searches or share personal data with third-party companies. "People are coming to us because they want more privacy, and it's generally spreading through word-of-mouth," Kamyl Bazbaz, DuckDuckGo vice president of communications, told USA TODAY.
ExpFinder: An Ensemble Expert Finding Model Integrating $N$-gram Vector Space Model and $\mu$CO-HITS
Kang, Yong-Bin, Du, Hung, Forkan, Abdur Rahim Mohammad, Jayaraman, Prem Prakash, Aryani, Amir, Sellis, Timos
Finding an expert plays a crucial role in driving successful collaborations and speeding up high-quality research development and innovations. However, the rapid growth of scientific publications and digital expertise data makes identifying the right experts a challenging problem. Existing approaches for finding experts given a topic can be categorised into information retrieval techniques based on vector space models, document language models, and graph-based models. In this paper, we propose $\textit{ExpFinder}$, a new ensemble model for expert finding, that integrates a novel $N$-gram vector space model, denoted as $n$VSM, and a graph-based model, denoted as $\textit{$\mu$CO-HITS}$, that is a proposed variation of the CO-HITS algorithm. The key of $n$VSM is to exploit recent inverse document frequency weighting method for $N$-gram words and $\textit{ExpFinder}$ incorporates $n$VSM into $\textit{$\mu$CO-HITS}$ to achieve expert finding. We comprehensively evaluate $\textit{ExpFinder}$ on four different datasets from the academic domains in comparison with six different expert finding models. The evaluation results show that $\textit{ExpFinder}$ is a highly effective model for expert finding, substantially outperforming all the compared models in 19% to 160.2%.
Multistage BiCross Encoder: Team GATE Entry for MLIA Multilingual Semantic Search Task 2
Singh, Iknoor, Scarton, Carolina, Bontcheva, Kalina
The Coronavirus (COVID-19) pandemic has led to a rapidly growing `infodemic' online. Thus, the accurate retrieval of reliable relevant data from millions of documents about COVID-19 has become urgently needed for the general public as well as for other stakeholders. The COVID-19 Multilingual Information Access (MLIA) initiative is a joint effort to ameliorate exchange of COVID-19 related information by developing applications and services through research and community participation. In this work, we present a search system called Multistage BiCross Encoder, developed by team GATE for the MLIA task 2 Multilingual Semantic Search. Multistage BiCross-Encoder is a sequential three stage pipeline which uses the Okapi BM25 algorithm and a transformer based bi-encoder and cross-encoder to effectively rank the documents with respect to the query. The results of round 1 show that our models achieve state-of-the-art performance for all ranking metrics for both monolingual and bilingual runs.
Quantum Mathematics in Artificial Intelligence
Widdows, Dominic, Kitto, Kirsty, Cohen, Trevor
In the decade since 2010, successes in artificial intelligence have been at the forefront of computer science and technology, and vector space models have solidified a position at the forefront of artificial intelligence. At the same time, quantum computers have become much more powerful, and announcements of major advances are frequently in the news. The mathematical techniques underlying both these areas have more in common than is sometimes realized. Vector spaces took a position at the axiomatic heart of quantum mechanics in the 1930s, and this adoption was a key motivation for the derivation of logic and probability from the linear geometry of vector spaces. Quantum interactions between particles are modelled using the tensor product, which is also used to express objects and operations in artificial neural networks. This paper describes some of these common mathematical areas, including examples of how they are used in artificial intelligence (AI), particularly in automated reasoning and natural language processing (NLP). Techniques discussed include vector spaces, scalar products, subspaces and implication, orthogonal projection and negation, dual vectors, density matrices, positive operators, and tensor products. Application areas include information retrieval, categorization and implication, modelling word-senses and disambiguation, inference in knowledge bases, and semantic composition. Some of these approaches can potentially be implemented on quantum hardware. Many of the practical steps in this implementation are in early stages, and some are already realized. Explaining some of the common mathematical tools can help researchers in both AI and quantum computing further exploit these overlaps, recognizing and exploring new directions along the way.
Model Generalization on COVID-19 Fake News Detection
Bang, Yejin, Ishii, Etsuko, Cahyawijaya, Samuel, Ji, Ziwei, Fung, Pascale
Amid the pandemic COVID-19, the world is facing unprecedented infodemic with the proliferation of both fake and real information. Considering the problematic consequences that the COVID-19 fake-news have brought, the scientific community has put effort to tackle it. To contribute to this fight against the infodemic, we aim to achieve a robust model for the COVID-19 fake-news detection task proposed at CONSTRAINT 2021 (FakeNews-19) by taking two separate approaches: 1) fine-tuning transformers based language models with robust loss functions and 2) removing harmful training instances through influence calculation. We further evaluate the robustness of our models by evaluating on different COVID-19 misinformation test set (Tweets-19) to understand model generalization ability. With the first approach, we achieve 98.13% for weighted F1 score (W-F1) for the shared task, whereas 38.18% W-F1 on the Tweets-19 highest. On the contrary, by performing influence data cleansing, our model with 99% cleansing percentage can achieve 54.33% W-F1 score on Tweets-19 with a trade-off. By evaluating our models on two COVID-19 fake-news test sets, we suggest the importance of model generalization ability in this task to step forward to tackle the COVID-19 fake-news problem in online social media platforms.