time travel
Time Travel: LLM-Assisted Semantic Behavior Localization with Git Bisect
We present a novel framework that integrates Large Language Models (LLMs) into the Git bisect process for semantic fault localization. Traditional bisect assumes deterministic predicates and binary failure states assumptions often violated in modern software development due to flaky tests, nonmonotonic regressions, and semantic divergence from upstream repositories. Our system augments bisect traversal with structured chain of thought reasoning, enabling commit by commit analysis under noisy conditions. We evaluate multiple open source and proprietary LLMs for their suitability and fine tune DeepSeekCoderV2 using QLoRA on a curated dataset of semantically labeled diffs. We adopt a weak supervision workflow to reduce annotation overhead, incorporating human in the loop corrections and self consistency filtering. Experiments across multiple open source projects show a 6.4 point absolute gain in success rate from 74.2 to 80.6 percent, leading to significantly fewer failed traversals and by experiment up to 2x reduction in average bisect time. We conclude with discussions on temporal reasoning, prompt design, and finetuning strategies tailored for commit level behavior analysis.
Our verdict on The Ministry of Time by Kaliane Bradley: A thumbs up
Kaliane Bradley's The Ministry of Time was (largely) a hit with the New Scientist Book Club One of the wonderful things about science fiction is the broadness of its church, and this was really brought home to me by our two most recent reads. The New Scientist Book Club moved from the hard science fiction spacefaring of Larry Niven's classic Ringworld in May to the near-future-set time travel of Kaliane Bradley's The Ministry of Time for our June read. The former takes its science seriously, diving into the maths and physics of its set-up with gusto; the latter – not so much. Culture editor Alison Flood rounds up the New Scientist Book Club's thoughts on our latest read, the science fiction classic Ringworld by Larry Niven The story of an unnamed civil servant who is given the job of supporting an "expat" from history – Commander Graham Gore, a (real) Victorian polar explorer from 1847 – The Ministry of Time is many things in one: a thriller, a romance, a piece of climate fiction (apparently), a science fiction novel about time. I couldn't put it down and loved all of it – apart, perhaps, from the ending.
US government announces it has achieved ability to 'manipulate space and time' with new technology
The Trump Administration quietly revealed it has futuristic technologies that literally bend time during a speech on'the golden age of American innovation.' The director of the White House Office of Science and Technology Policy, Michael Kratsios, declared that the US currently has the ability to'manipulate time and space' and'leave distance annihilated.' Kratsios made the bold statement on Monday during the Endless Frontiers Retreat, a scientific conference in Texas focused on promoting US technological innovations to maintain global competitiveness. The rest of the director's speech touched on American breakthroughs of the past and undoing Biden-era policies that the Trump Administration claims stifled innovation - adding that the regulatory process on new tech has been a burden since the 1970s. Kratsios actually referenced this again at the end of his speech, saying that Americans will soon have the choice to'craft new technologies and give themselves to scientific discoveries that will bend time and space.'
Scientists say time travel IS possible - and people have already done it
From H. G. Wells's The Time Machine to Christopher Nolan's Interstellar, the possibility of travelling through time has fascinated people for centuries. But, although it sounds like pure science fiction, physicists now believe that time travel really is possible. In fact, scientists say that people have already done it. But, before you start to plan your trip to ancient Rome, the experts caution that real time travel is nothing like what you see in the movies. It might seem obvious, but here on Earth, we all move through time at a speed of one second per second.
- North America > United States (0.17)
- North America > Canada > Ontario > National Capital Region > Ottawa (0.05)
- Asia > Middle East > Israel > Jerusalem District > Jerusalem (0.05)
- Media > Film (0.55)
- Leisure & Entertainment (0.55)
- Government > Space Agency (0.50)
Controllable Preference Optimization: Toward Controllable Multi-Objective Alignment
Guo, Yiju, Cui, Ganqu, Yuan, Lifan, Ding, Ning, Wang, Jiexin, Chen, Huimin, Sun, Bowen, Xie, Ruobing, Zhou, Jie, Lin, Yankai, Liu, Zhiyuan, Sun, Maosong
Alignment in artificial intelligence pursues the consistency between model responses and human preferences as well as values. In practice, the multifaceted nature of human preferences inadvertently introduces what is known as the "alignment tax" -a compromise where enhancements in alignment within one objective (e.g.,harmlessness) can diminish performance in others (e.g.,helpfulness). However, existing alignment techniques are mostly unidirectional, leading to suboptimal trade-offs and poor flexibility over various objectives. To navigate this challenge, we argue the prominence of grounding LLMs with evident preferences. We introduce controllable preference optimization (CPO), which explicitly specifies preference scores for different objectives, thereby guiding the model to generate responses that meet the requirements. Our experimental analysis reveals that the aligned models can provide responses that match various preferences among the "3H" (helpfulness, honesty, harmlessness) desiderata. Furthermore, by introducing diverse data and alignment goals, we surpass baseline methods in aligning with single objectives, hence mitigating the impact of the alignment tax and achieving Pareto improvements in multi-objective alignment.
- Asia > China > Beijing > Beijing (0.04)
- North America > United States (0.04)
- North America > Canada > Ontario > Toronto (0.04)
- Asia > Middle East > Jordan (0.04)
- Government (0.68)
- Information Technology (0.67)
Time Travel in LLMs: Tracing Data Contamination in Large Language Models
Golchin, Shahriar, Surdeanu, Mihai
Data contamination, i.e., the presence of test data from downstream tasks in the training data of large language models (LLMs), is a potential major issue in measuring LLMs' real effectiveness on other tasks. We propose a straightforward yet effective method for identifying data contamination within LLMs. At its core, our approach starts by identifying potential contamination at the instance level; using this information, our approach then assesses wider contamination at the partition level. To estimate contamination of individual instances, we employ "guided instruction:" a prompt consisting of the dataset name, partition type, and the random-length initial segment of a reference instance, asking the LLM to complete it. An instance is flagged as contaminated if the LLM's output either exactly or nearly matches the latter segment of the reference. To understand if an entire partition is contaminated, we propose two ideas. The first idea marks a dataset partition as contaminated if the average overlap score with the reference instances (as measured by ROUGE-L or BLEURT) is statistically significantly better with the completions from guided instruction compared to a "general instruction" that does not include the dataset and partition name. The second idea marks a dataset partition as contaminated if a classifier based on GPT-4 with few-shot in-context learning prompt marks multiple generated completions as exact/near-exact matches of the corresponding reference instances. Our best method achieves an accuracy between 92% and 100% in detecting if an LLM is contaminated with seven datasets, containing train and test/validation partitions, when contrasted with manual evaluation by human experts. Further, our findings indicate that GPT-4 is contaminated with AG News, WNLI, and XSum datasets.
Collective Mental Time Travel Can Influence the Future
We're often told to "be here now." Yet the mind is rarely tethered in place. We take mental trips to our past, revisiting what happened yesterday or when we were children, or we project into an imagined future: tomorrow's dinner date, the trajectory of our career at age 50. Rather than a diversion from the norm of mindful presence, this tendency to internally visit other time lines, called "mental time travel," is common; young adults, for example, think about their future an average of 59 times a day. Psychologists have suggested that this ability to time travel from the confines of our own heads is a fundamental aspect of what it means to be human.
The Best Sci-Fi TV Shows on Netflix
Netflix has an excellent international library, including German sci-fi gem Dark -- one of the best series on Netflix full stop. This adult animated anthology series spans a range of genres, with plenty of episodes hitting the Black Mirror comparison button. Robots in a post-apocalyptic city, farmers piloting mech suits and a space mission gone wrong all pop up in the first season. While the episodes can be hit and miss (some have been criticized for their treatment of women), you'll find plenty of thought-provoking and impressive animation. This apocalyptic sci-fi from Belgium will probably turn you off from flying any time soon.
- Europe > Belgium (0.25)
- North America > United States > Indiana (0.05)
- North America > Canada (0.05)
- (2 more...)
- Media > Television (1.00)
- Leisure & Entertainment (1.00)
'Eternal Threads' is a walking sim where time travel feel like a chore
"Eternal Threads" can be a fun puzzle to play with once you have all the pieces together. It encourages experimenting with different possible outcomes. With some combinations of choices, a character may survive the fire, but their life is irrevocably changed for the worse because of what they did or didn't do beforehand. But it takes a long time to get to that point. The full playtime is around 10 hours, and by the time you've absorbed enough context through cutscenes to understand how certain events impact the timeline, the game's in its final hours. It's a shame too, because the narrative's final act does take an interesting turn, but my patience was already exhausted by that point.
Time Travel in Video Games Isn't Really Cheating
Animal Crossing, like other games of its Nintendo Switch ilk, is meant to be grazed upon. It's designed for short bursts rather than binge sessions. There's only so much you can do in one day of play--hit your rocks, pick your fruit, check in at the shops to see what's new--then it's done. It's a nice reprieve from Assassin's Creed marathons for sure, but it can also start to drag if you play for too long. Once you're no longer enamored with the game, it just feels like a chore, and no one has time for that.