aaaa
Multilingual corpora for the study of new concepts in the social sciences and humanities:
Kyriakoglou, Revekka, Pappa, Anna
This article presents a hybrid methodology for building a multilingual corpus designed to support the study of emerging concepts in the humanities and social sciences (HSS), illustrated here through the case of ``non-technological innovation''. The corpus relies on two complementary sources: (1) textual content automatically extracted from company websites, cleaned for French and English, and (2) annual reports collected and automatically filtered according to documentary criteria (year, format, duplication). The processing pipeline includes automatic language detection, filtering of non-relevant content, extraction of relevant segments, and enrichment with structural metadata. From this initial corpus, a derived dataset in English is created for machine learning purposes. For each occurrence of a term from the expert lexicon, a contextual block of five sentences is extracted (two preceding and two following the sentence containing the term). Each occurrence is annotated with the thematic category associated with the term, enabling the construction of data suitable for supervised classification tasks. This approach results in a reproducible and extensible resource, suitable both for analyzing lexical variability around emerging concepts and for generating datasets dedicated to natural language processing applications.
- North America > United States > Maine (0.04)
- Europe > Middle East > Malta > Port Region > Southern Harbour District > Valletta (0.04)
- Europe > Bulgaria > Sofia City Province > Sofia (0.04)
- Asia > South Korea (0.04)
Searching for Carriers of the Diffuse Interstellar Bands Across Disciplines, using Natural Language Processing
d'Obrenan, Corentin van den Broek, Galliano, Frédéric, Minton, Jeremy, Botev, Viktor, Wu, Ronin
This is even more dramatic for interdisciplinary studies, where several fields need to be explored. A tool to help researchers overcome this is Natural Language Processing (NLP): a machine-learning (ML) technique that allows scientists to automatically synthesize information from many articles. As a practical example, we have used NLP to conduct an interdisciplinary search for compounds that could be carriers for Diffuse Interstellar Bands (DIBs), a long-standing open question in astrophysics. We have trained a NLP model on a corpus of 1.5 million cross-domain articles in open access, and fine-tuned this model with a corpus of astrophysical publications about DIBs. Our analysis points us toward several molecules, studied primarily in biology, having transitions at the wavelengths of several DIBs and composed of abundant interstellar atoms. Several of these molecules contain chromophores, small molecular groups responsible for the molecule's colour, that could be promising candidate carriers. Identifying viable carriers demonstrates the value of using NLP to tackle open scientific questions, in an interdisciplinary manner.
Routine Design for Mechanical Engineering
The system described in this article is currently working in the field at the Sales Department of EKATO, one of the world's most successful manufacturers of industrial mixing machines. It was developed in close cooperation with the Fraunhofer Institute for Information and Data Processing (IITB) during a three-year period. Industrial mixing machines, better known as agitators, are widely used in industrial manufacturing. They are especially useful for the chemical and pharmaceutical industries, food production, and biotechnology. The basic structure of an industrial agitator is shown in figure 1.
LTD Facilitates Learning in a Noisy Environment
Munro, Paul W., Hernández, Gerardina
This increase in synaptic strength must be countered by a mechanism for weakening the synapse [4]. The biological correlate, long-term depression (LTD) has also been observed in the laboratory; that is, synapses are observed to weaken when low presynaptic activity coincides with high postsynaptic activity [5]-[6].
- North America > United States > Massachusetts > Middlesex County > Cambridge (0.05)
- North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.05)
LTD Facilitates Learning in a Noisy Environment
Munro, Paul W., Hernández, Gerardina
This increase in synaptic strength must be countered by a mechanism for weakening the synapse [4]. The biological correlate, long-term depression (LTD) has also been observed in the laboratory; that is, synapses are observed to weaken when low presynaptic activity coincides with high postsynaptic activity [5]-[6].
- North America > United States > Massachusetts > Middlesex County > Cambridge (0.05)
- North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.05)
LTD Facilitates Learning in a Noisy Environment
Munro, Paul W., Hernández, Gerardina
This increase in synaptic strength must be countered by a mechanism for weakening the synapse [4]. The biological correlate, long-term depression (LTD) has also been observed in the laboratory; that is, synapses are observed to weaken when low presynaptic activity coincides with high postsynaptic activity [5]-[6].
- North America > United States > Massachusetts > Middlesex County > Cambridge (0.05)
- North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.05)