AITopics | Sakha Republic

Collaborating Authors

Sakha Republic

There is no nature anymore

MIT Technology ReviewApr-22-2026, 10:00:00 GMT

No part of the globe is free of human fingerprints. Should we deploy technology to change it back? When people talk about "nature," they're generally talking about things that aren't made by human beings. But while there is plenty of God's creation to go around, it is hard to think of anything on Earth that human hands haven't affected. In the Brazilian rainforest, scientists have found microplastics in the bellies of animals ranging from red howler monkeys to manatees. In remotest Yakutia, where much of the earth remains untrodden by human feet, the carbon in the sky above melts the permafrost below.

artificial intelligence, mit technology review featured topic, social media, (9 more...)

MIT Technology Review

Country:

Asia > Russia > Far Eastern Federal District > Sakha Republic (0.25)
North America > United States > Massachusetts (0.05)
Arctic Ocean (0.05)

Industry:

Health & Medicine > Therapeutic Area (0.49)
Health & Medicine > Pharmaceuticals & Biotechnology (0.49)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence (0.73)

Add feedback

RoLargeSum: A Large Dialect-Aware Romanian News Dataset for Summary, Headline, and Keyword Generation

Avram, Andrei-Marius, Timpuriu, Mircea, Iuga, Andreea, Matei, Vlad-Cristian, Tăiatu, Iulian-Marius, Găină, Tudor, Cercel, Dumitru-Clementin, Pop, Florin, Cercel, Mihaela-Claudia

arXiv.org Artificial IntelligenceDec-15-2024

Using supervised automatic summarisation methods requires sufficient corpora that include pairs of documents and their summaries. Similarly to many tasks in natural language processing, most of the datasets available for summarization are in English, posing challenges for developing summarization models in other languages. Thus, in this work, we introduce RoLargeSum, a novel large-scale summarization dataset for the Romanian language crawled from various publicly available news websites from Romania and the Republic of Moldova that were thoroughly cleaned to ensure a high-quality standard. RoLargeSum contains more than 615K news articles, together with their summaries, as well as their headlines, keywords, dialect, and other metadata that we found on the targeted websites. We further evaluated the performance of several BART variants and open-source large language models on RoLargeSum for benchmarking purposes. We manually evaluated the results of the best-performing system to gain insight into the potential pitfalls of this data set and future development.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2412.11317

Country:

Europe > Moldova (0.25)
Europe > Romania > București - Ilfov Development Region > Municipality of Bucharest > Bucharest (0.04)
Asia > Russia > Far Eastern Federal District > Sakha Republic (0.04)
(2 more...)

Genre: Research Report (1.00)

Industry: Government (0.93)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.95)

Add feedback

North Korean troops in Ukraine 'fair game', US warns Russia as war rages on

Al JazeeraOct-24-2024, 09:06:18 GMT

United States defence secretary Lloyd Austin has waded in on reports that North Korea was preparing to enter the Ukraine war with troops. "If they are co-belligerents, if their intention is to participate in this war on Russia's behalf, that is a very, very serious issue," Austin said. Austin was returning from his fourth visit to Kyiv, where he announced a 400m package of US weapons for Ukraine. John Kirby, White House national security spokesman, said Washington believes that at least 3,000 North Korean soldiers arrived this month by sea to Vladivostok, Russia's largest Pacific port. "These soldiers then travelled onward to multiple Russian military training sites in eastern Russia, where they are currently undergoing training," Kirby said on Wednesday.

russia, russian force, ukraine, (14 more...)

Al Jazeera

Country:

Africa (0.30)
Europe > Ukraine > Kyiv Oblast > Kyiv (0.26)
Asia > Russia > Far Eastern Federal District > Primorsky Krai > Vladivostok (0.26)
(11 more...)

Industry:

Government > Military (1.00)
Government > Regional Government > Europe Government > Russia Government (0.93)
Government > Regional Government > Asia Government > Russia Government (0.93)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (0.31)

Add feedback

Holistic Reasoning with Long-Context LMs: A Benchmark for Database Operations on Massive Textual Data

Maekawa, Seiji, Iso, Hayate, Bhutani, Nikita

arXiv.org Artificial IntelligenceOct-15-2024

The rapid increase in textual information means we need more efficient methods to sift through, organize, and understand it all. While retrieval-augmented generation (RAG) models excel in accessing information from large document collections, they struggle with complex tasks that require aggregation and reasoning over information spanning across multiple documents--what we call holistic reasoning. Long-context language models (LCLMs) have great potential for managing large-scale documents, but their holistic reasoning capabilities remain unclear. In this work, we introduce HoloBench, a novel framework that brings database reasoning operations into text-based contexts, making it easier to systematically evaluate how LCLMs handle holistic reasoning across large documents. Our approach adjusts key factors such as context length, information density, distribution of information, and query complexity to evaluate LCLMs comprehensively. Our experiments show that the amount of information in the context has a bigger influence on LCLM performance than the actual context length. Furthermore, the complexity of queries affects performance more than the amount of information, particularly for different types of queries. Interestingly, queries that involve finding maximum or minimum values are easier for LCLMs and are less affected by context length, even though they pose challenges for RAG systems. However, tasks requiring the aggregation of multiple pieces of information show a noticeable drop in accuracy as context length increases. Additionally, we find that while grouping relevant information generally improves performance, the optimal positioning varies across models. Our findings surface both the advancements and the ongoing challenges in achieving a holistic understanding of long contexts.

information, large language model, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2410.11996

Country:

North America > United States > California > Sonoma County (0.14)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
(11 more...)

Genre: Research Report > New Finding (0.66)

Industry:

Transportation > Infrastructure & Services > Airport (1.00)
Transportation > Air (1.00)
Consumer Products & Services (0.93)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

We're still in the steam-powered days of machine learning

#artificialintelligenceNov-27-2019, 23:06:32 GMT

The reveal of the ridiculous Cybertruck design last week made me curious about the history of cars. If you look at pictures of cars from the early days (as I, a Normal Person, did last Friday night), you'll see some insane ideas. Before we got to the Ford Model-T that standardized car production, people iterated on a ton of crazy stuff. It took some time for people to experiment and agree on what a car even was, what features it had, and how it needed to work. For example, for a long time in the beginning, quite a few cars ran on steam, until gasoline began to overtake them (thanks in part to Henry Ford's standardization of the assembly line, which made non-gasoline cars harder to produce.) Eventually, all the cars standardized to the form we know today: a closed car, powered by gasoline, with four wheels, four windows, seating 4-8 people. Even the godawful Cyberthing follows this model.

flyte, machine learning, platform, (15 more...)

#artificialintelligence

Country: Asia > Russia > Far Eastern Federal District > Sakha Republic (0.04)

Industry:

Transportation > Ground > Road (0.71)
Information Technology > Services (0.69)
Automobiles & Trucks > Manufacturer (0.54)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Prediction of Porosity and Permeability Alteration based on Machine Learning Algorithms

Erofeev, Andrei, Orlov, Denis, Ryzhov, Alexey, Koroteev, Dmitry

arXiv.org Machine LearningFeb-18-2019

The objective of this work is to study the applicability of various Machine Learning algorithms for prediction of some rock properties which geoscientists usually define due to special lab analysis. We demonstrate that these special properties can be predicted only basing on routine core analysis (RCA) data. To validate the approach core samples from the reservoir with soluble rock matrix components (salts) were tested within 100+ laboratory experiments. The challenge of the experiments was to characterize the rate of salts in cores and alteration of porosity and permeability after reservoir desalination due to drilling mud or water injection. For these three measured characteristics, we developed the relevant predictive models, which were based on the results of RCA and data on coring depth and top and bottom depths of productive horizons. To select the most accurate Machine Learning algorithm a comparative analysis has been performed. It was shown that different algorithms work better in different models. However, two hidden layers Neural network has demonstrated the best predictive ability and generalizability for all three rock characteristics jointly. The other algorithms, such as Support Vector Machine and Linear Regression, also worked well on the dataset, but in particular cases. Overall, the applied approach allows predicting the alteration of porosity and permeability during desalination in porous rocks and also evaluating salt concentration without direct measurements in a laboratory. This work also shows that developed approaches could be applied for prediction of other rock properties (residual brine and oil saturations, relative permeability, capillary pressure, and others), which laboratory measurements are time-consuming and expensive.

neural network, prediction, upstream oil & gas, (17 more...)

arXiv.org Machine Learning

1902.06525

Country:

Asia > Russia > Far Eastern Federal District > Sakha Republic (0.28)
Europe > Russia (0.14)

Genre: Research Report (0.82)

Industry: Energy > Oil & Gas > Upstream (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.69)

Add feedback

The Last Invention of Man - Issue 53: Monsters

NautilusOct-5-2017, 20:40:19 GMT

The Omega Team was the soul of the company. Whereas the rest of the enterprise brought in the money to keep things going, by various commercial applications of narrow AI, the Omega Team pushed ahead in their quest for what had always been the CEO's dream: building general artificial intelligence. Most other employees viewed "the Omegas," as they affectionately called them, as a bunch of pie-in-the-sky dreamers, perpetually decades away from their goal. They happily indulged them, however, because they liked the prestige that the cutting-edge work of the Omegas gave their company, and they also appreciated the improved algorithms that the Omegas occasionally gave them. What they didn't realize was that the Omegas had carefully crafted their image to hide a secret: They were extremely close to pulling off the most audacious plan in human history. Their charismatic CEO had handpicked them not only for being brilliant researchers, but also for ambition, idealism, and a strong commitment to helping humanity. He reminded them that their plan was extremely dangerous, and that if powerful governments found out, they would do virtually anything--including kidnapping--to shut them down or, preferably, to steal their code. But they were all in, 100 percent, for much the same reason that many of the world's top physicists joined the Manhattan Project to develop nuclear weapons: They were convinced that if they didn't do it first, someone less idealistic would. The AI they had built, nicknamed Prometheus, kept getting more capable. Although its cognitive abilities still lagged far behind those of humans in many areas, for example, social skills, the Omegas had pushed hard to make it extraordinary at one particular task: programming AI systems. They'd deliberately chosen this strategy because they had bought the intelligence explosion argument made by the British mathematician Irving Good back in 1965: "Let an ultraintelligent machine be defined as a machine that can far surpass all the intellectual activities of any man, however clever. Since the design of machines is one of these intellectual activities, an ultraintelligent machine could design even better machines; there would then unquestionably be an'intelligence explosion,' and the intelligence of man would be left far behind. Thus the first ultraintelligent machine is the last invention that man need ever make, provided that the machine is docile enough to tell us how to keep it under control."

artificial intelligence, prometheus, social media, (18 more...)

Nautilus

Country:

North America > United States > California (0.04)
Europe > Russia (0.04)
Asia > South Korea (0.04)
(3 more...)

Genre: Research Report > Experimental Study (0.46)

Industry:

Media > News (1.00)
Media > Film (1.00)
Leisure & Entertainment (1.00)
(5 more...)

Technology:

Information Technology > Artificial Intelligence (1.00)
Information Technology > Communications > Social Media (0.93)

Add feedback