Pre-Trained Language Models for Keyphrase Prediction: A Review

Umair, Muhammad, Sultana, Tangina, Lee, Young-Koo

Sep-2-2024–arXiv.org Artificial Intelligence

In the realm of NLP, BERT [2], extraction involves using a model to accurately identify GPT [3], and T5 [4] are some of the notable works that and classify the keyphrases in the document. The generation have consistently updated benchmark records in Pretrained of keyphrases is another task in which the model Language Model Keyphrase Extraction (PLM-predicts both present and absent keyphrases within the KPE) and Pre-trained Language Model Keyphrase Generation context of the document, introduced in [1]. The application (PLM-KPG) tasks [5], contributing significantly of deep learning technologies has witnessed to the development of NLP. a noticeable rise in using pre-trained language models The process of extracting keyphrases from a document (PLMs) in NLP in recent years. PLMs are trained using involves identifying and extracting significant different strategies on extensive text corpora and have phrases that represent the main topics or concepts discussed shown exceptional performance in various downstream within it. The primary objective is to extract the tasks, including Keyphrase Predation. PLMs using most essential and representative phrases using featurebased self-supervised learning differ from traditional learning [6, 7, 8, 9, 10] and linguistic techniques [11] methods, such as supervised learning, because they are like frequency analysis [12], part-of-speech tagging first trained on a large volume of unlabeled data before [13, 14], and syntactic parsing [15]. These methods fine-tuning small quantities of labeled data for specific can identify keyphrases based on their frequency, relevance, tasks.

extraction, keyphrase, keyphrase extraction, (13 more...)

arXiv.org Artificial Intelligence

Sep-2-2024

arXiv.org PDF

Add feedback

Country:
- Oceania > Australia
  - Tasmania > Hobart (0.04)
- North America
  - Dominican Republic (0.04)
  - United States
    - Washington > King County
      - Seattle (0.04)
    - Texas > Travis County
      - Austin (0.04)
    - New York > New York County
      - New York City (0.04)
    - Minnesota > Hennepin County
      - Minneapolis (0.14)
    - Louisiana > Orleans Parish
      - New Orleans (0.04)
  - Canada
    - Quebec (0.04)
    - British Columbia > Metro Vancouver Regional District
      - Vancouver (0.04)
- Europe
  - Switzerland (0.04)
  - Ukraine > Lviv Oblast
    - Lviv (0.04)
  - Italy > Tuscany
    - Florence (0.04)
  - France
    - Provence-Alpes-Côte d'Azur > Bouches-du-Rhône
      - Marseille (0.04)
    - Auvergne-Rhône-Alpes > Isère
      - Grenoble (0.04)
  - Belgium > Brussels-Capital Region
    - Brussels (0.04)
- Asia
  - South Korea (0.04)
  - Bangladesh (0.04)
  - Vietnam > Hanoi
    - Hanoi (0.04)
  - Singapore > Central Region
    - Singapore (0.04)
  - Middle East
    - UAE > Abu Dhabi Emirate
      - Abu Dhabi (0.04)
    - Qatar > Ad-Dawhah
      - Doha (0.04)
  - China
    - Hong Kong (0.04)
    - Shandong Province > Qingdao (0.04)

Genre:
- Overview (1.00)
- Research Report
  - New Finding (1.00)
  - Promising Solution (0.93)

Industry:
- Health & Medicine (1.00)
- Education (1.00)
- Law (0.68)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language
    - Large Language Model (1.00)
    - Chatbot (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)