jack
KnowLogic: A Benchmark for Commonsense Reasoning via Knowledge-Driven Data Synthesis
Zhan, Weidong, Wang, Yue, Hu, Nan, Xiao, Liming, Ma, Jingyuan, Qin, Yuhang, Li, Zheng, Yang, Yixin, Deng, Sirui, Ding, Jinkun, Ma, Wenhan, Li, Rui, Luo, Weilin, Liu, Qun, Sui, Zhifang
Current evaluations of commonsense reasoning in LLMs are hindered by the scarcity of natural language corpora with structured annotations for reasoning tasks. To address this, we introduce KnowLogic, a benchmark generated through a knowledge-driven synthetic data strategy. KnowLogic integrates diverse commonsense knowledge, plausible scenarios, and various types of logical reasoning. One of the key advantages of KnowLogic is its adjustable difficulty levels, allowing for flexible control over question complexity. It also includes fine-grained labels for in-depth evaluation of LLMs' reasoning abilities across multiple dimensions. Our benchmark consists of 3,000 bilingual (Chinese and English) questions across various domains, and presents significant challenges for current LLMs, with the highest-performing model achieving only 69.57\%. Our analysis highlights common errors, such as misunderstandings of low-frequency commonsense, logical inconsistencies, and overthinking. This approach, along with our benchmark, provides a valuable tool for assessing and enhancing LLMs' commonsense reasoning capabilities and can be applied to a wide range of knowledge domains.
- North America > United States (0.14)
- Asia > Thailand (0.14)
- Leisure & Entertainment (0.93)
- Education > Educational Setting (0.68)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Commonsense Reasoning (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.97)
Jack the Ripper and the case of the missing DNA evidence
Feedback is New Scientist's popular sideways look at the latest science and technology news. You can submit items you believe may amuse readers to Feedback by emailing feedback@newscientist.com Feedback is as fond of true crime as the next morbidly curious ghoul, so we have occasionally dipped our toes into the never-ending well of speculation about the Whitechapel murders of 1888-91 and the near-mythical Jack the Ripper. Although frankly, we didn't get much further than Alan Moore and Eddie Campbell's From Hell, which (spoiler!) ties the killings to the British establishment and the Freemasons, who supposedly arranged the murders to create an evil psychic force that would perpetuate the patriarchy. But the field of "Ripperology" extends far beyond one eccentric graphic novel.
2023 could be the year for large language models - Jack Of All Techs
Check out all the on-demand sessions from the Intelligent Security Summit here. The launch of OpenAI's ChatGPT has the world abuzz about the advanced capabilities of artificial intelligence (AI). How will it transform industries? What does it mean for Google Search? These are just a small sampling of the questions many have been asking about the possibilities.
Willow's Jon Kasdan on How Its Recent Cameo Came to Be
Before the release of the Willow sequel series for Disney, showrunner Jon Kasdan had stated that the show would take some time in putting Val Kilmer's character of Madmartigan from the original film on screen. The show has taken steps to make sure that the character's presence is felt throughout the series, though, thanks to archive footage from the 1988 film, and also through the recent appearance of Christian Slater as Allagash, a close friend of the hero who crosses paths with the show's young heroes. This past week's episode, "Prisoners of Skellin," manages to bring back Kilmer's character in a different way. As Allagash and the teens look for a magical artifact inside the tomb of Wiggledoom, Kit (Ruby Cruz) hears her father Madmartigan's voice calling to her from somewhere inside the tomb. He doesn't make a physical appearance, as he's trapped in either Skellin (or another dimension that Skellin leads to), but it's looking as though getting him out of there will serve as part of the drive for the season's final two episodes.
- Media > Film (1.00)
- Leisure & Entertainment (1.00)
Why this ChatGPT moment harks back to the original iPhone - Jack Of All Techs
Check out all the on-demand sessions from the Intelligent Security Summit here. Exactly three weeks ago, OpenAI released ChatGPT. Since then, it has been nearly impossible to keep up with both the hyped-up excitement and brow-furrowing concerns around use cases for the text-generating chatbot, ranging from the fun (writing limericks and rap lyrics) and the clever (writing prompts for text-to-image generators like DALL-E and Stable Diffusion) to the dangerous (threat actors using it for generating phishing emails) and the game-changing (could Google's entire search model [subscription required] be upended?). Is it possible to compare this moment in the evolution of generative AI to any other technology development? According to Forrester Research AI/ML analyst Rowan Curran, it is.
Want to be a data scientist in 2023? Here's what you need to know - Jack Of All Techs
"I felt there's a gap between what I learned in school, and what I actually do, and I also feel very insecure sometimes," she said. "I didn't know a lot of other data scientists who worked in the industry, so I wished I could have a community and talk to them." Essentially, said Liu, a data scientist takes something raw and translates it into something meaningful. The power of data science, she explained, is making sense of the past to make a recommendation for the future. "A data scientist is basically someone who solves a business problem with data," she explained.
Runaway (1984) - IMDb
Set in undetermined future society where robotics are a major part of everyday life, from performing household chores to construction and more, Runaway follows Officer Jack Ramsay (Tom Selleck), head of the so-called'Runaway Squad'. His hi-tech unit deals with out of control robots, intervening where humans may be endangered by the machines' malfunctions. When Ramsay discovers a plot by criminal genius Luther (Gene Simmons) to sell advanced microchips and cutting edge weaponry to the highest bidder, he finds himself taking on not only Luther, but also the dangerous killer's deadly robotic creations. Written and directed by the late Michael Crichton, Runaway continues the theme of the dangers of technology affecting the lives of humans that had been previously visited in his previous works Westworld and Looker. Reviewing Runaway now, thirty years after it's original release, is a real eye opener to Crichton's foresight.
- Media > Film (1.00)
- Leisure & Entertainment (1.00)
Meet Flippy, Sippy and Chippy: These robots can cook fries, pour drinks and make tortilla chips
Whether it's creating perfectly cooked fries and burgers or pouring soda without any spills, robot chefs are venturing further into the $296 billion U.S. fast food industry amid a nationwide labor shortage. Miso Robotics, a California-based company, built a kitchen bot called Flippy that was able to cook 300 burgers per day and then expanded into whipping up fries with the second version. The fast-casual chain Wing Zone in May inked a deal with Miso to install Flippy 2 into all future restaurant locations. Jack in the Box is deploying that same machine along with the company's Sippy bot - which quickly pours, labels and seals beverage orders - this year with a goal of getting into 10 high-volume locations in 2023. Whether it's creating perfectly cooked fries and burgers or pouring soda without any spills, robot chefs are venturing further into the $296 billion U.S. fast food industry amid a nationwide labor shortage And Miso has another machine called Chippy that can cook up Chipotle's tortilla chips - which will be integrated into a southern California location of the Mexican restaurant this year.
- North America > United States > California > San Diego County > Vista (0.05)
- North America > United States > California > San Diego County > Chula Vista (0.05)
Why AIops may be necessary for the future of engineering - Jack Of All Techs
Were you unable to attend Transform 2022? Check out all of the summit sessions in our on-demand library now! Machine learning has crossed the chasm. In 2020, McKinsey found that out of 2,395 companies surveyed, 50% had an ongoing investment in machine learning. By 2030, machine learning is predicted to deliver around $13 trillion.
Customer and employee experience mistakes to avoid and how AI can help - Jack Of All Techs
Were you unable to attend Transform 2022? Check out all of the summit sessions in our on-demand library now! Enterprise leaders are constantly evaluating how technology can better serve the needs of their customers and employees. As AI technology progresses, businesses recognize the massive potential to improve customer and employee experiences and positively impact their bottom line. That's why more than half of leaders are investing accordingly, with plans to increase AI budgets in customer experience by at least 25% next year.