West Hollywood
HybridFC: A Hybrid Fact-Checking Approach for Knowledge Graphs
Qudus, Umair, Roeder, Michael, Saleem, Muhammad, Ngomo, Axel-Cyrille Ngonga
We consider fact-checking approaches that aim to predict the veracity of assertions in knowledge graphs. Five main categories of fact-checking approaches for knowledge graphs have been proposed in the recent literature, of which each is subject to partially overlapping limitations. In particular, current text-based approaches are limited by manual feature engineering. Path-based and rule-based approaches are limited by their exclusive use of knowledge graphs as background knowledge, and embedding-based approaches suffer from low accuracy scores on current fact-checking tasks. We propose a hybrid approach -- dubbed HybridFC -- that exploits the diversity of existing categories of fact-checking approaches within an ensemble learning setting to achieve a significantly better prediction performance. In particular, our approach outperforms the state of the art by 0.14 to 0.27 in terms of Area Under the Receiver Operating Characteristic curve on the FactBench dataset. Our code is open-source and can be found at https://github.com/dice-group/HybridFC.
Vision-Language and Large Language Model Performance in Gastroenterology: GPT, Claude, Llama, Phi, Mistral, Gemma, and Quantized Models
Safavi-Naini, Seyed Amir Ahmad, Ali, Shuhaib, Shahab, Omer, Shahhoseini, Zahra, Savage, Thomas, Rafiee, Sara, Samaan, Jamil S, Shabeeb, Reem Al, Ladak, Farah, Yang, Jamie O, Echavarria, Juan, Babar, Sumbal, Shaukat, Aasma, Margolis, Samuel, Tatonetti, Nicholas P, Nadkarni, Girish, Kurdi, Bara El, Soroush, Ali
Background and Aims: This study evaluates the medical reasoning performance of large language models (LLMs) and vision language models (VLMs) in gastroenterology. Methods: We used 300 gastroenterology board exam-style multiple-choice questions, 138 of which contain images to systematically assess the impact of model configurations and parameters and prompt engineering strategies utilizing GPT-3.5. Next, we assessed the performance of proprietary and open-source LLMs (versions), including GPT (3.5, 4, 4o, 4omini), Claude (3, 3.5), Gemini (1.0), Mistral, Llama (2, 3, 3.1), Mixtral, and Phi (3), across different interfaces (web and API), computing environments (cloud and local), and model precisions (with and without quantization). Finally, we assessed accuracy using a semiautomated pipeline. Results: Among the proprietary models, GPT-4o (73.7%) and Claude3.5-Sonnet (74.0%) achieved the highest accuracy, outperforming the top open-source models: Llama3.1-405b (64%), Llama3.1-70b (58.3%), and Mixtral-8x7b (54.3%). Among the quantized open-source models, the 6-bit quantized Phi3-14b (48.7%) performed best. The scores of the quantized models were comparable to those of the full-precision models Llama2-7b, Llama2--13b, and Gemma2-9b. Notably, VLM performance on image-containing questions did not improve when the images were provided and worsened when LLM-generated captions were provided. In contrast, a 10% increase in accuracy was observed when images were accompanied by human-crafted image descriptions. Conclusion: In conclusion, while LLMs exhibit robust zero-shot performance in medical reasoning, the integration of visual data remains a challenge for VLMs. Effective deployment involves carefully determining optimal model configurations, encouraging users to consider either the high performance of proprietary models or the flexible adaptability of open-source models.
Reddit-Impacts: A Named Entity Recognition Dataset for Analyzing Clinical and Social Effects of Substance Use Derived from Social Media
Ge, Yao, Das, Sudeshna, O'Connor, Karen, Al-Garadi, Mohammed Ali, Gonzalez-Hernandez, Graciela, Sarker, Abeed
Substance use disorders (SUDs) are a growing concern globally, necessitating enhanced understanding of the problem and its trends through data-driven research. Social media are unique and important sources of information about SUDs, particularly since the data in such sources are often generated by people with lived experiences. In this paper, we introduce Reddit-Impacts, a challenging Named Entity Recognition (NER) dataset curated from subreddits dedicated to discussions on prescription and illicit opioids, as well as medications for opioid use disorder. The dataset specifically concentrates on the lesser-studied, yet critically important, aspects of substance use--its clinical and social impacts. We collected data from chosen subreddits using the publicly available Application Programming Interface for Reddit. We manually annotated text spans representing clinical and social impacts reported by people who also reported personal nonmedical use of substances including but not limited to opioids, stimulants and benzodiazepines. Our objective is to create a resource that can enable the development of systems that can automatically detect clinical and social impacts of substance use from text-based social media data. The successful development of such systems may enable us to better understand how nonmedical use of substances affects individual health and societal dynamics, aiding the development of effective public health strategies. In addition to creating the annotated data set, we applied several machine learning models to establish baseline performances. Specifically, we experimented with transformer models like BERT, and RoBERTa, one few-shot learning model DANN by leveraging the full training dataset, and GPT-3.5 by using one-shot learning, for automatic NER of clinical and social impacts. The dataset has been made available through the 2024 SMM4H shared tasks.
Waymo Will Bring Autonomous Taxis to Los Angeles--Its Biggest Challenge Yet
Paid autonomous vehicle service is coming to Los Angeles, thanks to a decision by California regulators today to allow Alphabet subsidiary Waymo to operate in the city. Under the new ruling, Waymo is also permitted to launch service in a large section of the San Francisco Peninsula. The decision by the California Public Utilities Commission will likely prove controversial. It comes over the protest of local governments and agencies, including the Los Angeles Department of Transportation, the San Francisco County Transportation Authority, the city of South San Francisco, and the County of San Mateo. All argued that local government and citizens should have more input and oversight over the expanded autonomous taxi service.
San Mateo County is the latest community expressing concern against Waymo, driverless cars
Another California community is raising concerns about plans to unleash the Waymo self-driving vehicle in its jurisdiction, following several incidents involving autonomous ride-hailing cars that resulted in injuries. San Mateo County, in the San Francisco Bay Area, has requested more information from state regulators before allowing Google-owned Waymo to operate its driverless vehicles in the county. San Mateo County made the request after Waymo submitted a letter Jan. 19 to the California Public Utilities Commission, asking the agency to approve its proposed expansion of its Automated Vehicle Passenger Services into portions of the San Francisco Peninsula, which includes San Mateo County, as well as the southwest region of Los Angeles County. The company has already been serving a portion of San Francisco, from Lands End to Bernal Heights. The autonomous car began offering rides for a limited time in November in Santa Monica, Century City, West Hollywood, Mid-City Koreatwon and downtown L.A., giving residents a chance at testing the driverless ride.
Mayor Bass pushes for more testing before permitting robotaxis in Los Angeles
As Waymo robotaxis plucked up passengers for free this week in Santa Monica and Venice, worry grew among Los Angeles officials about the safety of driverless cars on city streets. Mayor Bass asked regulators Wednesday to increase their scrutiny of automated taxis and said the city should have a say in how they are regulated. The move comes after a Cruise robotaxi dragged a person down a San Francisco street last month and the company allegedly failed to disclose the footage to the state Department of Motor Vehicles. The DMV suspended the General Motors-owned company's permits and Cruise has since announced it will suspend U.S. operations. The incident in San Francisco -- where the two driverless fleets were doing business -- was among several that raised red flags among Los Angeles officials, who have begun to see more and more robotaxis being tested on city streets.
America Is About to See Way More Driverless Cars
The future of driverless cars in America is a promotional booth with a surfboard and a movie director's clapboard. Robotaxis have officially arrived in Los Angeles, and last week, residents lined up in Santa Monica's main promenade to get a smartphone code needed to ride them. For now, the cars, from the Alphabet-owned start-up Waymo, won't leave the tame streets of Santa Monica. But in the coming months, they'll embark on a multi-month "tour" of the city, heading to West Hollywood, downtown L.A., and several other neighborhoods. For the past decade, the two leading robotaxi companies, Waymo and Cruise, have been focused primarily on San Francisco and Phoenix, where they both already take paid passengers.
The 40 Greatest Stand-Alone TV Episodes of All Time
Whether we're living in the age of Peak TV or Trough TV, one thing is clear: There's too much TV. Thankfully, not every show has to be watched in its entirety. One of the best things about television is its serialized nature, the continuous thread that strings viewers along from one episode to the next. It's a cliché that prestige television is the new novel precisely because of the way that many dramas develop their characters and plots over many hours of storytelling. But an older virtue of TV is its brevity--the way a scenario can be introduced and resolved within the space of an hour, or half that--and some of the best episodes are less like chapters in a long-running novel than like short stories or short films. There's been no shortage of debate about this question, but for our purposes, we're defining it simply as an episode that stands up on its own, whether or not you've seen the rest of the show. Some are "bottle episodes," which typically confine a small cast to one location to save money. Some are "departure episodes," in which a show abandons its usual format or style to suddenly become, say, silent, animated, a musical, or about a minor character it was never about before. But not all bottle episodes and departure episodes are stand-alones, and vice versa. It's for this reason that you won't find Breaking Bad's celebrated "Fly" on this list: It may be a bottle episode, but it doesn't stand alone, because the best thing about it--how the housefly is a metaphor for everything else going on in the series--is comprehensible only to those who have watched the show. These are English-language selections, and, out of fairness, we have limited ourselves to one episode per series, although some shows are full of stellar contenders. Use these picks--arranged in chronological order, with an admitted bias toward our most recent, and best, era of television--to populate your streaming queue with a feast of bite-sized morsels, each of which could double as either a snackable introduction to a new show or a satisfying meal in itself. If movies made Alfred Hitchcock a name, TV made him a brand. The master of suspense embraced the burgeoning medium in 1955 with Alfred Hitchcock Presents (later renamed The Alfred Hitchcock Hour), an anthology series whose entries began and ended the same way: the titular celebrity providing context to a unique half-hour thriller, typically an adaption of a short story by an esteemed author (John Cheever, Ray Bradbury, many others).
When Workplace Surveillance Goes Terribly Wrong
This story is part of Future Tense Fiction, a monthly series of short stories from Future Tense and Arizona State University's Center for Science and the Imagination about how technology and science will change our lives. Amanda sat at her desk, picking at the same $30 Little Gem salad she ordered daily, suffering a small burning sensation in her gut that was triggered either by acid reflux or the dying embers of her rapidly expiring conscience. Of course, it was standard procedure for her husband to demand that the security firm Dark Metal surveil potential new hires for any of his multibillion-dollar companies, but this was the first time Amanda had been involved in contracting the private intelligence agency herself. Seedlings is your venture, Reid had promised her, even though he'd named himself CEO. I want you to take the lead on this. Amanda was COO of Seedlings and reported to her husband, who dismissed Amanda's concerns about the legal ramifications of their actions. Worrying about the law was something poor people did, Reid insisted. Besides, she'd never seen Reid do anything that nefarious with this type of information. But Maggie Everett was the type of candidate that pleased Reid. Amanda had done her job, which was to find Maggie, and the people at Dark Metal had done theirs, which was to surveil her and create a comprehensive biographical profile. This seemed like overkill to Amanda. Maggie wasn't in the running to become a high-profile executive at one of Reid's billion-dollar firms. She was being interviewed to work at a preschool. Certainly, Seedlings differed from other private preschools--there was the possibility Maggie would be exposed to confidential information. But this was what NDAs were for. Unleashing a network of spies upon a poor teacher who would ultimately be responsible for 10 toddlers seemed like an absurd waste of resources. And this was just Phase 1. Phase 2 would have to wait until after Maggie was hired, of course. Amanda reopened Dark Metal's inch-thick dossier. The logline: Maggie was smart but stupid. Smart: She'd majored in English at Yale, then received an MFA in creative writing from Brown, and finally a master's in early childhood education from Columbia. Stupid: She'd accumulated $103,345 in student debt, which she'd never pay off unless she took a job somewhere like Seedlings.
Is AI the future of Hollywood? How the hype squares with reality
For every problem you can think of, someone is out there pitching a solution that involves artificial intelligence. AI could help solve such intractable problems as climate change and dangerous work conditions, the technology's most eager boosters promise. It could even fix the much-maligned "Game of Thrones" finale, if you believe one of the industry's most powerful proponents and a featured speaker at this month's South by Southwest conference. "Imagine if you could ask your AI to make a new ending that goes a different way," said Greg Brockman, president and co-founder of OpenAI, the research group behind the conversation software ChatGPT and the image-generation module DALL-E. "Maybe even put yourself in there as a main character or something, having interactive experiences."