WikiSeq: Mining Maximally Informative Simple Sequences from Wikipedia

Nair, Goutam (International Institute of Information Technology, Hyderabad) | Pudi, Vikram (International Institute of Information Technology, Hyderabad)

Feb-4-2017–AAAI Conferences

The problem of ordering documents in a large collection into a sequence that is efficient for learning (both human and machine) is of high practical significance, but has not yet been well-formulated. We formulate this problem as mining a maximally informative simple sequence of documents. The mined sequence should be maximally informative in the sense that the reader learns quickly by reading only a few documents, and it should be simple so that the reader is not overwhelmed while trying to learn the content. The task can be posed as: Given that a reader wishes to read (at most) k documents, which documents should be selected from the repository and in what order, so as to provide maximum information. We present the WikiSeq algorithm for this purpose. We also design a metric based on information-gain to help objectively evaluate WikiSeq, and conduct experiments to compare with indicative baselines. Finally, we provide case-studies to subjectively illustrate WikiSeq’s merits.

artificial intelligence, machine learning, natural language, (16 more...)

AAAI Conferences

Feb-4-2017

Conferences PDF

Add feedback

Country:
- North America > United States
  - Nevada (0.04)
  - California > San Diego County
    - San Diego (0.04)
- Europe
  - Denmark (0.04)
  - Czechia > Prague (0.04)
  - Portugal > Lisbon
    - Lisbon (0.04)
- Asia
  - Middle East > Jordan (0.05)
  - Japan > Hokkaidō
    - Hokkaidō Prefecture > Sapporo (0.04)
  - India > Telangana
    - Hyderabad (0.04)
  - China
    - Zhejiang Province > Hangzhou (0.04)
    - Beijing > Beijing (0.04)

Genre:
- Research Report (0.47)

Industry:
- Education (0.93)

Technology:
- Information Technology
  - Communications (1.00)
  - Artificial Intelligence
    - Natural Language (1.00)
    - Machine Learning (1.00)
    - Representation & Reasoning > Search (0.69)
    - Cognitive Science (0.67)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found