AITopics

2205.13532

Country: North America > Canada > Ontario > Toronto (0.14)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.34)

#artificialintelligenceOct-11-2022, 19:40:55 GMT

How To Solve A Classification Task With Machine Learning

By now, I'm sure you've heard the term Machine Learning thrown a lot. Since most big companies and financial institutions rely on data to operate at such a large scale, it's no wonder that fields in data science are taking off. But what exactly is Machine Learning and how can we use it in a practical sense? Machine Learning is the field of study that gives computers the ability to learn without being explicitly programmed. The quote above is in my opinion the best general definition of what machine learning does.

applicant, dataset, machine learning, (13 more...)

#artificialintelligence

Industry: Banking & Finance > Loans (0.51)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.99)

Moon, Brady, Chatterjee, Satrajit, Scherer, Sebastian

TIGRIS: An Informed Sampling-based Algorithm for Informative Path Planning

Informative path planning is an important and challenging problem in robotics that remains to be solved in a manner that allows for wide-spread implementation and real-world practical adoption. Among various reasons for this, one is the lack of approaches that allow for informative path planning in high-dimensional spaces and non-trivial sensor constraints. In this work we present a sampling-based approach that allows us to tackle the challenges of large and high-dimensional search spaces. This is done by performing informed sampling in the high-dimensional continuous space and incorporating potential information gain along edges in the reward estimation. This method rapidly generates a global path that maximizes information gain for the given path budget constraints. We discuss the details of our implementation for an example use case of searching for multiple objects of interest in a large search space using a fixed-wing UAV with a forward-facing camera. We compare our approach to a sampling-based planner baseline and demonstrate how our contributions allow our approach to consistently out-perform the baseline by 18.0%. With this we thus present a practical and generalizable informative path planning framework that can be used for very large environments, limited budgets, and high dimensional search spaces, such as robots with motion constraints or high-dimensional configuration spaces.

artificial intelligence, machine learning, planning & scheduling, (13 more...)

doi: 10.1109/IROS47612.2022.9981992

2203.1283

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.14)
Asia > Singapore > Central Region > Singapore (0.04)
Asia > Middle East > Republic of Türkiye > Karaman Province > Karaman (0.04)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.47)

Wang, Zifeng, Sun, Jimeng

PromptEHR: Conditional Electronic Healthcare Records Generation with Prompt Learning

Accessing longitudinal multimodal Electronic Healthcare Records (EHRs) is challenging due to privacy concerns, which hinders the use of ML for healthcare applications. Synthetic EHRs generation bypasses the need to share sensitive real patient records. However, existing methods generate single-modal EHRs by unconditional generation or by longitudinal inference, which falls short of low flexibility and makes unrealistic EHRs. In this work, we propose to formulate EHRs generation as a text-to-text translation task by language models (LMs), which suffices to highly flexible event imputation during generation. We also design prompt learning to control the generation conditioned by numerical and categorical demographic features. We evaluate synthetic EHRs quality by two perplexity measures accounting for their longitudinal pattern (longitudinal imputation perplexity, lpl) and the connections cross modalities (cross-modality imputation perplexity, mpl). Moreover, we utilize two adversaries: membership and attribute inference attacks for privacy-preserving evaluation. Experiments on MIMIC-III data demonstrate the superiority of our methods on realistic EHRs generation (53.1\% decrease of lpl and 45.3\% decrease of mpl on average compared to the best baselines) with low privacy risks. Software is available at https://github.com/RyanWangZf/PromptEHR.

data mining, machine learning, natural language, (19 more...)

2211.01761

Country:

North America > United States > Illinois > Champaign County > Urbana (0.04)
Asia (0.04)

Genre: Research Report (0.82)

Industry:

Health & Medicine > Therapeutic Area (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Health Care Technology > Medical Record (1.00)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(2 more...)

Graph Neural Networks for Low-Energy Event Classification & Reconstruction in IceCube

Abbasi, R., Ackermann, M., Adams, J., Aggarwal, N., Aguilar, J. A., Ahlers, M., Ahrens, M., Alameddine, J. M., Alves, A. A. Jr., Amin, N. M., Andeen, K., Anderson, T., Anton, G., Argüelles, C., Ashida, Y., Athanasiadou, S., Axani, S., Bai, X., V., A. Balagopal, Baricevic, M., Barwick, S. W., Basu, V., Bay, R., Beatty, J. J., Becker, K. -H., Tjus, J. Becker, Beise, J., Bellenghi, C., Benda, S., BenZvi, S., Berley, D., Bernardini, E., Besson, D. Z., Binder, G., Bindig, D., Blaufuss, E., Blot, S., Bontempo, F., Book, J. Y., Borowka, J., Meneguolo, C. Boscolo, Böser, S., Botner, O., Böttcher, J., Bourbeau, E., Braun, J., Brinson, B., Brostean-Kaiser, J., Burley, R. T., Busse, R. S., Campana, M. A., Carnie-Bronca, E. G., Chen, C., Chen, Z., Chirkin, D., Choi, K., Clark, B. A., Classen, L., Coleman, A., Collin, G. H., Connolly, A., Conrad, J. M., Coppin, P., Correa, P., Countryman, S., Cowen, D. F., Cross, R., Dappen, C., Dave, P., De Clercq, C., DeLaunay, J. J., López, D. Delgado, Dembinski, H., Deoskar, K., Desai, A., Desiati, P., de Vries, K. D., de Wasseige, G., DeYoung, T., Diaz, A., Díaz-Vélez, J. C., Dittmer, M., Dujmovic, H., DuVernois, M. A., Ehrhardt, T., Eller, P., Engel, R., Erpenbeck, H., Evans, J., Evenson, P. A., Fan, K. L., Fazely, A. R., Fedynitch, A., Feigl, N., Fiedlschuster, S., Fienberg, A. T., Finley, C., Fischer, L., Fox, D., Franckowiak, A., Friedman, E., Fritz, A., Fürst, P., Gaisser, T. K., Gallagher, J., Ganster, E., Garcia, A., Garrappa, S., Gerhardt, L., Ghadimi, A., Glaser, C., Glauch, T., Glüsenkamp, T., Goehlke, N., Gonzalez, J. G., Goswami, S., Grant, D., Gray, S. J., Grégoire, T., Griswold, S., Günther, C., Gutjahr, P., Haack, C., Hallgren, A., Halliday, R., Halve, L., Halzen, F., Hamdaoui, H., Minh, M. Ha, Hanson, K., Hardin, J., Harnisch, A. A., Hatch, P., Haungs, A., Helbing, K., Hellrung, J., Henningsen, F., Heuermann, L., Hickford, S., Hill, C., Hill, G. C., Hoffman, K. D., Hoshina, K., Hou, W., Huber, T., Hultqvist, K., Hünnefeld, M., Hussain, R., Hymon, K., In, S., Iovine, N., Ishihara, A., Jansson, M., Japaridze, G. S., Jeong, M., Jin, M., Jones, B. J. P., Kang, D., Kang, W., Kang, X., Kappes, A., Kappesser, D., Kardum, L., Karg, T., Karl, M., Karle, A., Katz, U., Kauer, M., Kelley, J. L., Kheirandish, A., Kin, K., Kiryluk, J., Klein, S. R., Kochocki, A., Koirala, R., Kolanoski, H., Kontrimas, T., Köpke, L., Kopper, C., Koskinen, D. J., Koundal, P., Kovacevich, M., Kowalski, M., Kozynets, T., Krupczak, E., Kun, E., Kurahashi, N., Lad, N., Gualda, C. Lagunas, Larson, M. J., Lauber, F., Lazar, J. P., Lee, J. W., Leonard, K., Leszczyńska, A., Lincetto, M., Liu, Q. R., Liubarska, M., Lohfink, E., Love, C., Mariscal, C. J. Lozano, Lu, L., Lucarelli, F., Ludwig, A., Luszczak, W., Lyu, Y., Ma, W. Y., Madsen, J., Mahn, K. B. M., Makino, Y., Mancina, S., Sainte, W. Marie, Mariş, I. C., Marka, S., Marka, Z., Marsee, M., Martinez-Soler, I., Maruyama, R., McElroy, T., McNally, F., Mead, J. V., Meagher, K., Mechbal, S., Medina, A., Meier, M., Meighen-Berger, S., Merckx, Y., Micallef, J., Mockler, D., Montaruli, T., Moore, R. W., Morse, R., Moulai, M., Mukherjee, T., Naab, R., Nagai, R., Naumann, U., Nayerhoda, A., Necker, J., Neumann, M., Niederhausen, H., Nisa, M. U., Nowicki, S. C., Pollmann, A. Obertacke, Oehler, M., Oeyen, B., Olivas, A., Orsoe, R., Osborn, J., O'Sullivan, E., Pandya, H., Pankova, D. V., Park, N., Parker, G. K., Paudel, E. N., Paul, L., Heros, C. Pérez de los, Peters, L., Petersen, T. C., Peterson, J., Philippen, S., Pieper, S., Pizzuto, A., Plum, M., Popovych, Y., Porcelli, A., Rodriguez, M. Prado, Pries, B., Procter-Murphy, R., Przybylski, G. T., Raab, C., Rack-Helleis, J., Rameez, M., Rawlins, K., Rechav, Z., Rehman, A., Reichherzer, P., Renzi, G., Resconi, E., Reusch, S., Rhode, W., Richman, M., Riedel, B., Roberts, E. J., Robertson, S., Rodan, S., Roellinghoff, G., Rongen, M., Rott, C., Ruhe, T., Ruohan, L., Ryckbosch, D., Cantu, D. Rysewyk, Safa, I., Saffer, J., Salazar-Gallegos, D., Sampathkumar, P., Herrera, S. E. Sanchez, Sandrock, A., Santander, M., Sarkar, S., Sarkar, S., Schaufel, M., Schieler, H., Schindler, S., Schlueter, B., Schmidt, T., Schneider, J., Schröder, F. G., Schumacher, L., Schwefer, G., Sclafani, S., Seckel, D., Seunarine, S., Sharma, A., Shefali, S., Shimizu, N., Silva, M., Skrzypek, B., Smithers, B., Snihur, R., Soedingrekso, J., Søgaard, A., Soldin, D., Spannfellner, C., Spiczak, G. M., Spiering, C., Stamatikos, M., Stanev, T., Stein, R., Stezelberger, T., Stürwald, T., Stuttard, T., Sullivan, G. W., Taboada, I., Ter-Antonyan, S., Thompson, W. G., Thwaites, J., Tilav, S., Tollefson, K., Tönnis, C., Toscano, S., Tosi, D., Trettin, A., Tung, C. F., Turcotte, R., Twagirayezu, J. P., Ty, B., Elorrieta, M. A. Unland, Upshaw, K., Valtonen-Mattila, N., Vandenbroucke, J., van Eijndhoven, N., Vannerom, D., van Santen, J., Vara, J., Veitch-Michaelis, J., Verpoest, S., Veske, D., Walck, C., Wang, W., Watson, T. B., Weaver, C., Weigel, P., Weindl, A., Weldert, J., Wendt, C., Werthebach, J., Weyrauch, M., Whitehorn, N., Wiebusch, C. H., Willey, N., Williams, D. R., Wolf, M., Wrede, G., Wulff, J., Xu, X. W., Yanez, J. P., Yildizci, E., Yoshida, S., Yu, S., Yuan, T., Zhang, Z., Zhelnin, P.

IceCube, a cubic-kilometer array of optical sensors built to detect atmospheric and astrophysical neutrinos between 1 GeV and 1 PeV, is deployed 1.45 km to 2.45 km below the surface of the ice sheet at the South Pole. The classification and reconstruction of events from the in-ice detectors play a central role in the analysis of data from IceCube. Reconstructing and classifying events is a challenge due to the irregular detector geometry, inhomogeneous scattering and absorption of light in the ice and, below 100 GeV, the relatively low number of signal photons produced per event. To address this challenge, it is possible to represent IceCube events as point cloud graphs and use a Graph Neural Network (GNN) as the classification and reconstruction method. The GNN is capable of distinguishing neutrino events from cosmic-ray backgrounds, classifying different neutrino event types, and reconstructing the deposited energy, direction and interaction vertex. Based on simulation, we provide a comparison in the 1-100 GeV energy range to the current state-of-the-art maximum likelihood techniques used in current IceCube analyses, including the effects of known systematic uncertainties. For neutrino event classification, the GNN increases the signal efficiency by 18% at a fixed false positive rate (FPR), compared to current IceCube methods. Alternatively, the GNN offers a reduction of the FPR by over a factor 8 (to below half a percent) at a fixed signal efficiency. For the reconstruction of energy, direction, and interaction vertex, the resolution improves by an average of 13%-20% compared to current maximum likelihood techniques in the energy range of 1-30 GeV. The GNN, when run on a GPU, is capable of processing IceCube events at a rate nearly double of the median IceCube trigger rate of 2.7 kHz, which opens the possibility of using low energy neutrinos in online searches for transient events.

artificial intelligence, machine learning, university, (16 more...)

doi: 10.1088/1748-0221/17/11/P11003

2209.03042

Country:

North America > United States > Wisconsin > Dane County > Madison (0.14)
North America > United States > California > Alameda County > Berkeley (0.14)
Europe > Switzerland > Geneva > Geneva (0.14)
(50 more...)

Genre: Research Report (0.50)

Industry:

Government > Regional Government (0.93)
Energy (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.54)

Detecting Propagators of Disinformation on Twitter Using Quantitative Discursive Analysis

Bailey, Mark M.

Efforts by foreign actors to influence public opinion have gained considerable attention because of their potential to impact democratic elections. Thus, the ability to identify and counter sources of disinformation is increasingly becoming a top priority for government entities in order to protect the integrity of democratic processes. This study presents a method of identifying Russian disinformation bots on Twitter using centering resonance analysis and Clauset-Newman-Moore community detection. The data reflect a significant degree of discursive dissimilarity between known Russian disinformation bots and a control set of Twitter users during the timeframe of the 2016 U.S. Presidential Election. The data also demonstrate statistically significant classification capabilities (MCC = 0.9070) based on community clustering. The prediction algorithm is very effective at identifying true positives (bots), but is not able to resolve true negatives (non-bots) because of the lack of discursive similarity between control users. This leads to a highly sensitive means of identifying propagators of disinformation with a high degree of discursive similarity on Twitter, with implications for limiting the spread of disinformation that could impact democratic processes.

artificial intelligence, discursive similarity, machine learning, (17 more...)

2210.0576

Country:

South America > Argentina > Patagonia > Río Negro Province > Viedma (0.04)
North America > United States > Maryland > Montgomery County > Bethesda (0.04)
North America > Canada > Ontario > Toronto (0.04)

Genre: Research Report (0.65)

Industry:

Government > Voting & Elections (1.00)
Government > Military > Cyberwarfare (0.96)
Government > Regional Government > Europe Government > Russia Government (0.86)
(2 more...)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.73)

Laperrière, Gaëlle, Pelloin, Valentin, Rouvier, Mickaël, Stafylakis, Themos, Estève, Yannick

On the Use of Semantically-Aligned Speech Representations for Spoken Language Understanding

In this paper we examine the use of semantically-aligned speech representations for end-to-end spoken language understanding (SLU). We employ the recently-introduced SAMU-XLSR model, which is designed to generate a single embedding that captures the semantics at the utterance level, semantically aligned across different languages. This model combines the acoustic frame-level speech representation learning model (XLS-R) with the Language Agnostic BERT Sentence Embedding (LaBSE) model. We show that the use of the SAMU-XLSR model instead of the initial XLS-R model improves significantly the performance in the framework of end-to-end SLU. Finally, we present the benefits of using this model towards language portability in SLU.

artificial intelligence, machine learning, natural language, (19 more...)

2210.05291

Country:

Europe > France (0.04)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.04)
Europe > Russia > Northwestern Federal District > Leningrad Oblast > Saint Petersburg (0.04)
(3 more...)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.47)

EOCSA: Predicting Prognosis of Epithelial Ovarian Cancer with Whole Slide Histopathological Images

Liu, Tianling, Su, Ran, Sun, Changming, Li, Xiuting, Wei, Leyi

Ovarian cancer is one of the most serious cancers that threaten women around the world. Epithelial ovarian cancer (EOC), as the most commonly seen subtype of ovarian cancer, has rather high mortality rate and poor prognosis among various gynecological cancers. Survival analysis outcome is able to provide treatment advices to doctors. In recent years, with the development of medical imaging technology, survival prediction approaches based on pathological images have been proposed. In this study, we designed a deep framework named EOCSA which analyzes the prognosis of EOC patients based on pathological whole slide images (WSIs). Specifically, we first randomly extracted patches from WSIs and grouped them into multiple clusters. Next, we developed a survival prediction model, named DeepConvAttentionSurv (DCAS), which was able to extract patch-level features, removed less discriminative clusters and predicted the EOC survival precisely. Particularly, channel attention, spatial attention, and neuron attention mechanisms were used to improve the performance of feature extraction. Then patient-level features were generated from our weight calculation method and the survival time was finally estimated using LASSO-Cox model. The proposed EOCSA is efficient and effective in predicting prognosis of EOC and the DCAS ensures more informative and discriminative features can be extracted. As far as we know, our work is the first to analyze the survival of EOC based on WSIs and deep neural network technologies. The experimental results demonstrate that our proposed framework has achieved state-of-the-art performance of 0.980 C-index. The implementation of the approach can be found at https://github.com/RanSuLab/EOCprognosis.

data mining, machine learning, survival analysis, (18 more...)

doi: 10.1016/j.eswa.2022.117643

2210.05258

Country:

North America > United States (0.28)
Asia > Singapore (0.04)
Oceania > Australia (0.04)
Asia > China > Tianjin Province > Tianjin (0.04)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Health & Medicine > Therapeutic Area > Oncology > Ovarian Cancer (1.00)
Health & Medicine > Therapeutic Area > Obstetrics/Gynecology (1.00)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.96)

Knowledge-Driven New Drug Recommendation

Wu, Zhenbang, Yao, Huaxiu, Su, Zhe, Liebovitz, David M, Glass, Lucas M, Zou, James, Finn, Chelsea, Sun, Jimeng

Drug recommendation assists doctors in prescribing personalized medications to patients based on their health conditions. Existing drug recommendation solutions adopt the supervised multi-label classification setup and only work with existing drugs with sufficient prescription data from many patients. However, newly approved drugs do not have much historical prescription data and cannot leverage existing drug recommendation methods. To address this, we formulate the new drug recommendation as a few-shot learning problem. Yet, directly applying existing few-shot learning algorithms faces two challenges: (1) complex relations among diseases and drugs and (2) numerous false-negative patients who were eligible but did not yet use the new drugs. To tackle these challenges, we propose EDGE, which can quickly adapt to the recommendation for a new drug with limited prescription data from a few support patients. EDGE maintains a drug-dependent multi-phenotype few-shot learner to bridge the gap between existing and new drugs. Specifically, EDGE leverages the drug ontology to link new drugs to existing drugs with similar treatment effects and learns ontology-based drug representations. Such drug representations are used to customize the metric space of the phenotype-driven patient representations, which are composed of a set of phenotypes capturing complex patient health status. Lastly, EDGE eliminates the false-negative supervision signal using an external drug-disease knowledge base. We evaluate EDGE on two real-world datasets: the public EHR data (MIMIC-IV) and private industrial claims data. Results show that EDGE achieves 7.3% improvement on the ROC-AUC score over the best baseline.

artificial intelligence, machine learning, representation, (16 more...)

2210.05572

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > Illinois > Champaign County > Urbana (0.04)
North America > United States > California > San Diego County > San Diego (0.04)
(2 more...)

Genre:

Research Report > New Finding (0.66)
Research Report > Experimental Study (0.46)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Therapeutic Area > Immunology (0.93)
Government > Regional Government > North America Government > United States Government > FDA (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.90)

#artificialintelligenceOct-10-2022, 13:26:20 GMT

F1 to F-beta

Originally published on Towards AI the World's Leading AI and Technology News and Media Company. If you are building an AI-related product or service, we invite you to consider becoming an AI sponsor. At Towards AI, we help scale AI and technology startups. Let us help you unleash your technology to the masses. The F-1 score is a popular binary classification metric representing a balance between precision and recall. It is the Harmonic mean of precision and recall.

f-beta, precision and recall, sklearn, (9 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)