Goto

Collaborating Authors

 environmental science


A Survey of Foundation Models for Environmental Science

Yu, Runlong, Chen, Shengyu, Xie, Yiqun, Jia, Xiaowei

arXiv.org Artificial Intelligence

Modeling environmental ecosystems is essential for effective resource management, sustainable development, and understanding complex ecological processes. However, traditional methods frequently struggle with the inherent complexity, interconnectedness, and limited data of such systems. Foundation models, with their large-scale pre-training and universal representations, offer transformative opportunities by integrating diverse data sources, capturing spatiotemporal dependencies, and adapting to a broad range of tasks. This survey presents a comprehensive overview of foundation model applications in environmental science, highlighting advancements in forward prediction, data generation, data assimilation, downscaling, model ensembling, and decision-making across domains. We also detail the development process of these models, covering data collection, architecture design, training, tuning, and evaluation. By showcasing these emerging methods, we aim to foster interdisciplinary collaboration and advance the integration of cutting-edge machine learning for sustainable solutions in environmental science.


Unveiling the Role of Artificial Intelligence and Stock Market Growth in Achieving Carbon Neutrality in the United States: An ARDL Model Analysis

Rafi, Azizul Hakim, Chowdhury, Abdullah Al Abrar, Sultana, Adita, Noman, Abdulla All

arXiv.org Artificial Intelligence

Given the fact that climate change has become one of the most pressing problems in many countries in recent years, specialized research on how to mitigate climate change has been adopted by many countries. Within this discussion, the influence of advanced technologies in achieving carbon neutrality has been discussed. While several studies investigated how AI and Digital innovations could be used to reduce the environmental footprint, the actual influence of AI in reducing CO2 emissions (a proxy measuring carbon footprint) has yet to be investigated. This paper studies the role of advanced technologies in general, and Artificial Intelligence (AI) and ICT use in particular, in advancing carbon neutrality in the United States, between 2021. Secondly, this paper examines how Stock Market Growth, ICT use, Gross Domestic Product (GDP), and Population affect CO2 emissions using the STIRPAT model. After examining stationarity among the variables using a variety of unit root tests, this study concluded that there are no unit root problems across all the variables, with a mixed order of integration. The ARDL bounds test for cointegration revealed that variables in this study have a long-run relationship. Moreover, the estimates revealed from the ARDL model in the short- and long-run indicated that economic growth, stock market capitalization, and population significantly contributed to the carbon emissions in both the short-run and long-run. Conversely, AI and ICT use significantly reduced carbon emissions over both periods. Furthermore, findings were confirmed to be robust using FMOLS, DOLS, and CCR estimations. Furthermore, diagnostic tests indicated the absence of serial correlation, heteroscedasticity, and specification errors and, thus, the model was robust.


Morphological Detection and Classification of Microplastics and Nanoplastics Emerged from Consumer Products by Deep Learning

Rezvani, Hadi, Zarrabi, Navid, Mehta, Ishaan, Kolios, Christopher, Jaafar, Hussein Ali, Kao, Cheng-Hao, Saeedi, Sajad, Yousefi, Nariman

arXiv.org Artificial Intelligence

For example, a U-Net [31] model can be used for some studies have utilized manually annotated images for deep semantic segmentation, and a Convolutional Neural Network learning applications involving microplastics, their datasets are (CNN) can then classify the segmented pixels, as demonstrated not publicly accessible [22], [23], [25]. Notably, there is only in [22], [24]. It is also possible to perform instance segmentation one other open-source Scanning Electron Microscopy (SEM) directly from the start. For instance, a Mask R-CNN dataset on microplastics, presented in [24], which categorizes model can simultaneously identify regions of interest, classify particles by shape (e.g., fragments, fibers, and beads) and each detected object, and generate a mask for each instance, features a more limited size distribution. These contributions as shown by [23]. Additionally, Faster R-CNN, primarily used not only address the urgent environmental issue of microplastic for object detection, has been applied to microscopic images to contamination but also set a new benchmark for detecting and classify microplastics into two polymer types [25]. Given the analyzing microplastics in aquatic environments, paving the nature of our dataset, where overlapping and crowded MNPs way for future innovations in the field.


EnviroExam: Benchmarking Environmental Science Knowledge of Large Language Models

Huang, Yu, Guo, Liang, Guo, Wanqian, Tao, Zhe, Lv, Yang, Sun, Zhihao, Zhao, Dongfang

arXiv.org Artificial Intelligence

In the field of environmental science, it is crucial to have robust evaluation metrics for large language models to ensure their efficacy and accuracy. We propose EnviroExam, a comprehensive evaluation method designed to assess the knowledge of large language models in the field of environmental science. EnviroExam is based on the curricula of top international universities, covering undergraduate, master's, and doctoral courses, and includes 936 questions across 42 core courses. By conducting 0-shot and 5-shot tests on 31 open-source large language models, EnviroExam reveals the performance differences among these models in the domain of environmental science and provides detailed evaluation standards. The results show that 61.3% of the models passed the 5-shot tests, while 48.39% passed the 0-shot tests. By introducing the coefficient of variation as an indicator, we evaluate the performance of mainstream open-source large language models in environmental science from multiple perspectives, providing effective criteria for selecting and fine-tuning language models in this field. Future research will involve constructing more domain-specific test sets using specialized environmental science textbooks to further enhance the accuracy and specificity of the evaluation.


ChatGPT cites the most-cited articles and journals, relying solely on Google Scholar's citation counts. As a result, AI may amplify the Matthew Effect in environmental science

Petiska, Eduard

arXiv.org Artificial Intelligence

ChatGPT (GPT) has become one of the most talked-about innovations in recent years, with over 100 million users worldwide. However, there is still limited knowledge about the sources of information GPT utilizes. As a result, we carried out a study focusing on the sources of information within the field of environmental science. In our study, we asked GPT to identify the ten most significant subdisciplines within the field of environmental science. We then asked it to compose a scientific review article on each subdiscipline, including 25 references. We proceeded to analyze these references, focusing on factors such as the number of citations, publication date, and the journal in which the work was published. Our findings indicate that GPT tends to cite highly-cited publications in environmental science, with a median citation count of 1184.5. It also exhibits a preference for older publications, with a median publication year of 2010, and predominantly refers to well-respected journals in the field, with Nature being the most cited journal by GPT. Interestingly, our findings suggest that GPT seems to exclusively rely on citation count data from Google Scholar for the works it cites, rather than utilizing citation information from other scientific databases such as Web of Science or Scopus. In conclusion, our study suggests that Google Scholar citations play a significant role as a predictor for mentioning a study in GPT-generated content. This finding reinforces the dominance of Google Scholar among scientific databases and perpetuates the Matthew Effect in science, where the rich get richer in terms of citations. With many scholars already utilizing GPT for literature review purposes, we can anticipate further disparities and an expanding gap between lesser-cited and highly-cited publications.


Machine Guided Discovery of Novel Carbon Capture Solvents

McDonagh, James L., Wunsch, Benjamin H., Zavitsanou, Stamatia, Harrison, Alexander, Elmegreen, Bruce, Gifford, Stacey, van Kessel, Theodore, Cipcigan, Flaviu

arXiv.org Artificial Intelligence

The increasing importance of carbon capture technologies for deployment in remediating CO2 emissions, and thus the necessity to improve capture materials to allow scalability and efficiency, faces the challenge of materials development, which can require substantial costs and time. Machine learning offers a promising method for reducing the time and resource burdens of materials development through efficient correlation of structure-property relationships to allow down-selection and focusing on promising candidates. Towards demonstrating this, we have developed an end-to-end "discovery cycle" to select new aqueous amines compatible with the commercially viable acid gas scrubbing carbon capture. We combine a simple, rapid laboratory assay for CO2 absorption with a machine learning based molecular fingerprinting model approach. The prediction process shows 60% accuracy against experiment for both material parameters and 80% for a single parameter on an external test set. The discovery cycle determined several promising amines that were verified experimentally, and which had not been applied to carbon capture previously. In the process we have compiled a large, single-source data set for carbon capture amines and produced an open source machine learning tool for the identification of amine molecule candidates (https://github.com/IBM/Carbon-capture-fingerprint-generation).


AI in Fraud Detection

#artificialintelligence

Artificial intelligence (AI) has the potential to play a significant role in the field of environmental science, particularly in relation to monitoring and mitigating climate change. One way AI can be used in environmental science is through the analysis of large amounts of data. For example, AI algorithms can be trained to process satellite imagery to detect changes in land use, such as deforestation or urban expansion. Similarly, AI can be used to analyze data from weather stations and climate models to better understand and predict the effects of climate change. Another way AI can be used in environmental science is to optimize decision-making related to climate change mitigation.


A Primer on Topological Data Analysis to Support Image Analysis Tasks in Environmental Science

#artificialintelligence

Topological data analysis (TDA) is a tool from data science and mathematics that is beginning to make waves in environmental science. In this work, we seek to provide an intuitive and understandable introduction to a tool from TDA that is particularly useful for the analysis of imagery, namely persistent homology. We briefly discuss the theoretical background but focus primarily on understanding the output of this tool and discussing what information it can glean. To this end, we frame our discussion around a guiding example of classifying satellite images from the Sugar, Fish, Flower, and Gravel Dataset produced for the study of mesocale organization of clouds by Rasp et. We demonstrate how persistent homology and its vectorization, persistence landscapes, can be used in a workflow with a simple machine learning algorithm to obtain good results, and explore in detail how we can explain this behavior in terms of image-level features.


A call for ethical use of AI in Earth system science

#artificialintelligence

Artificial intelligence holds vast potential to help solve a number of challenging problems in Earth system science, from improving prediction of severe weather events to increasing the efficiency of climate models. But as in all AI applications, the use of machine learning and other techniques in environmental science has the potential to introduce biases that could deepen inequities. The authors of a new paper published in the journal Environmental Data Science argue that researchers must develop ethical, responsible, and trustworthy approaches to applying AI in Earth system science to ensure that unintentional consequences do not worsen environmental and climate injustice. "It's really exciting to see all the ways researchers are finding to creatively apply artificial intelligence in weather, climate, and other environmental science research," said David John Gagne, a scientist at the National Center for Atmospheric Research (NCAR) and a paper co-author. "But we have a responsibility to ensure that we don't cause more harm than good."


The Need for Ethical, Responsible, and Trustworthy Artificial Intelligence for Environmental Sciences

McGovern, Amy, Ebert-Uphoff, Imme, Gagne, David John II, Bostrom, Ann

arXiv.org Artificial Intelligence

Given the growing use of Artificial Intelligence (AI) and machine learning (ML) methods across all aspects of environmental sciences, it is imperative that we initiate a discussion about the ethical and responsible use of AI. In fact, much can be learned from other domains where AI was introduced, often with the best of intentions, yet often led to unintended societal consequences, such as hard coding racial bias in the criminal justice system or increasing economic inequality through the financial system. A common misconception is that the environmental sciences are immune to such unintended consequences when AI is being used, as most data come from observations, and AI algorithms are based on mathematical formulas, which are often seen as objective. In this article, we argue the opposite can be the case. Using specific examples, we demonstrate many ways in which the use of AI can introduce similar consequences in the environmental sciences. This article will stimulate discussion and research efforts in this direction. As a community, we should avoid repeating any foreseeable mistakes made in other domains through the introduction of AI. In fact, with proper precautions, AI can be a great tool to help {\it reduce} climate and environmental injustice. We primarily focus on weather and climate examples but the conclusions apply broadly across the environmental sciences.