Goto

Collaborating Authors

 Information Retrieval


Link the World: Improving Open-domain Conversation with Dynamic Spatiotemporal-aware Knowledge

arXiv.org Artificial Intelligence

Making chatbots world aware in a conversation like a human is a crucial challenge, where the world may contain dynamic knowledge and spatiotemporal state. Several recent advances have tried to link the dialog system to a static knowledge base or search engine, but they do not contain all the world information needed for conversations. In contrast, we propose a new method to improve the dialogue system using spatiotemporal aware dynamic knowledge. We utilize service information as a way for the dialogue system to link the world. The system actively builds a request according to the dialog context and spatiotemporal state to get service information and then generates world aware responses. To implement this method, we collect DuSinc, an open-domain human-human dialogue dataset, where a participant can access the service to get the information needed for dialogue responses. Through automatic and human evaluations, we found that service information significantly improves the consistency, informativeness, factuality, and engagingness of the dialogue system, making it behave more like a human. Compared to the pre-trained models without spatiotemporal aware dynamic knowledge, the overall session-level score was improved by 60.87\%. The collection dataset and methods will be open-sourced.


GitHub - qdrant/qdrant: Qdrant - Vector Search Engine for the next generation of AI applications

#artificialintelligence

Qdrant (read: quadrant) is a vector similarity search engine. It provides a production-ready service with a convenient API to store, search, and manage points - vectors with an additional payload. Qdrant is tailored to extended filtering support. It makes it useful for all sorts of neural-network or semantic-based matching, faceted search, and other applications. Qdrant is written in Rust, which makes it fast and reliable even under high load.


Generalizability of Code Clone Detection on CodeBERT

arXiv.org Artificial Intelligence

Transformer networks such as CodeBERT already achieve outstanding results for code clone detection in benchmark datasets, so one could assume that this task has already been solved. However, code clone detection is not a trivial task. Semantic code clones, in particular, are challenging to detect. We show that the generalizability of CodeBERT decreases by evaluating two different subsets of Java code clones from BigCloneBench. We observe a significant drop in F1 score when we evaluate different code snippets and functionality IDs than those used for model building.


Topic Detection in Continuous Sign Language Videos

arXiv.org Artificial Intelligence

Significant progress has been made recently on challenging tasks in automatic sign language understanding, such as sign language recognition, translation and production. However, these works have focused on datasets with relatively few samples, short recordings and limited vocabulary and signing space. In this work, we introduce the novel task of sign language topic detection. We base our experiments on How2Sign, a large-scale video dataset spanning multiple semantic domains. We provide strong baselines for the task of topic detection and present a comparison between different visual features commonly used in the domain of sign language.


RAGUEL: Recourse-Aware Group Unfairness Elimination

arXiv.org Artificial Intelligence

While machine learning and ranking-based systems are in widespread use for sensitive decision-making processes (e.g., determining job candidates, assigning credit scores), they are rife with concerns over unintended biases in their outcomes, which makes algorithmic fairness (e.g., demographic parity, equal opportunity) an objective of interest. 'Algorithmic recourse' offers feasible recovery actions to change unwanted outcomes through the modification of attributes. We introduce the notion of ranked group-level recourse fairness, and develop a 'recourse-aware ranking' solution that satisfies ranked recourse fairness constraints while minimizing the cost of suggested modifications. Our solution suggests interventions that can reorder the ranked list of database records and mitigate group-level unfairness; specifically, disproportionate representation of sub-groups and recourse cost imbalance. This re-ranking identifies the minimum modifications to data points, with these attribute modifications weighted according to their ease of recourse. We then present an efficient block-based extension that enables re-ranking at any granularity (e.g., multiple brackets of bank loan interest rates, multiple pages of search engine results). Evaluation on real datasets shows that, while existing methods may even exacerbate recourse unfairness, our solution -- RAGUEL -- significantly improves recourse-aware fairness. RAGUEL outperforms alternatives at improving recourse fairness, through a combined process of counterfactual generation and re-ranking, whilst remaining efficient for large-scale datasets.


Definition Of Search Engine - What Is A #SearchEngine #SEO #FrizeMedia

#artificialintelligence

We invite you to experience the distinctive style of Alisa Hotels Accra conference rooms and facilities designed to accommodate small to large events with a state of the art array of technology and catering services to make your event a total success. Would you prefer to share this page with others by linking to it? The most excellent way to explain the definition of search engine,is to say, it is a website or an online service that collects and organizes content from all over the internet. If you wish to locate information on the internet,you would enter a query about what it is that you are searching for,the search engine provides links to content and information that matches the query you are searching for. Search engines use powerful computer software that has the capability of searching through huge volumes of text or other data for specified keywords and then returning a list of files or documents where the keywords were found ranked in order of relevance. Search engines make life easier for users by tracking down massive on-line information on a wide variety of topics and are valuable on-line sources of secondary data.


Uses Of Statistics In Our Daily Life

#artificialintelligence

The world is getting data rich. Data is now a commodity for most businesses and a lot of this data is unstructured. This means that there is a need for Natural Language Processing (NLP). Natural language processing (NLP) is a branch of computer science, artificial intelligence, and computational linguistics concerned with the interactions between computers and human (natural) languages. Its goal is to enable computers to communicate with humans in a natural way, i.e. by using language, rather than simple strings of symbols. NLP includes the tasks of speech recognition, speech synthesis, document retrieval, understanding natural language, machine translation, and information retrieval.


SEO / PPC Tools To Grow Your Online Traffic

#artificialintelligence

We generate content ideas based on your keyword search. Register now and get more DATA! You can use our service to find new topics, keywords, and phrases that you might not have thought of before. This will help improve your SEO and provide new content ideas for your blog or eCommerce product page. The best part is that it's free to register!


Why is My Search Engine Yahoo?

#artificialintelligence

Why is my search engine Yahoo? It's a common question among webmasters, especially those who are just starting out in their internet endeavors. The reason is fairly simple: Yahoo is a very dominant search engine when it comes to searches. A lot of webmasters have grown dependent on the Search Marketing Working Group or SEM for backlinks and ranking. Therefore, if you want to make your page visible in Yahoo's result pages, it is important that you know how to increase your search engine positioning.


Machine Learning Engineer - NLP (Remote, Canada) - Remote Tech Jobs

#artificialintelligence

Level AI is a Mountain View, CA-based startup innovating in the Voice AI space. We are backed by top VCs, technologists from Silicon Valley and industry experts. We are on a mission to revolutionize the customer sales experience for businesses. We are innovating in speech AI, NLP and information retrieval systems to bring customers and businesses closer to one another. As one of the critical members of the Level team your work will be new and of the highest impact to shape the future of AI in businesses.