AITopics | coco 1

Collaborating Authors

coco 1

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

NoFunEval: Funny How Code LMs Falter on Requirements Beyond Functional Correctness

Singhal, Manav, Aggarwal, Tushar, Awasthi, Abhijeet, Natarajan, Nagarajan, Kanade, Aditya

arXiv.org Artificial IntelligenceFeb-2-2024

Existing evaluation benchmarks of language models of code (code LMs) focus almost exclusively on whether the LMs can generate functionally-correct code. In real-world software engineering, developers think beyond functional correctness. They have requirements on "how" a functionality should be implemented to meet overall system design objectives like efficiency, security, and maintainability. They would also trust the code LMs more if the LMs demonstrate robust understanding of requirements and code semantics. We propose a new benchmark NoFunEval to evaluate code LMs on non-functional requirements and simple classification instances for both functional and non-functional requirements. We propose a prompting method, Coding Concepts (CoCo), as a way for a developer to communicate the domain knowledge to the LMs. We conduct an extensive evaluation of twenty-two code LMs. Our finding is that they generally falter when tested on our benchmark, hinting at fundamental blindspots in their training setups. Surprisingly, even the classification accuracy on functional-correctness instances derived from the popular HumanEval benchmark is low, calling in question the depth of their comprehension and the source of their success in generating functionally-correct code in the first place. We will release our benchmark and evaluation scripts publicly at https://aka.ms/NoFunEval.

lms, non-functional requirement, requirement, (12 more...)

arXiv.org Artificial Intelligence

2401.15963

Country:

Europe > Belgium > Brussels-Capital Region > Brussels (0.04)
Asia > India (0.04)

Genre: Research Report > New Finding (0.34)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.98)

Add feedback

Remotely-Piloted Delivery Service Expands Its Capabilities

#artificialintelligenceDec-10-2021, 20:00:32 GMT

Coco, the robot based delivery service, announced the official launch of COCO 1, a larger, more advanced version of its signature pink bot. The COCO 1 is a first of its kind delivery robot designed and manufactured in partnership with the largest micro mobility hardware manufacturer, Segway. Coco is currently deploying 1,000s of COCO 1 robots to serve local merchants in multiple cities, over the next few months. With its increased carrying capacity, the COCO 1 will deliver larger orders for a wider range of merchants, further eliminating the need for car-based delivery. Compared to the current model, the COCO 1 offers a number of added features including a more efficient drivetrain and a larger battery capacity that allows for an increased delivery radius of up to three miles, nearly double the radius of the original model.

coco 1, merchant, remotely-piloted delivery service expand, (10 more...)

#artificialintelligence

Country: North America > United States > California > Los Angeles County > Los Angeles (0.08)

Genre: Press Release (0.33)

Industry:

Transportation (0.87)
Energy > Energy Storage (0.57)
Electrical Industrial Apparatus (0.57)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.41)

Add feedback