Beyond Social Media Analytics: Understanding Human Behaviour and Deep Emotion using Self Structuring Incremental Machine Learning
This thesis develops a conceptual framework considering social data as representing the surface layer of a hierarchy of human social behaviours, needs and cognition which is employed to transform social data into representations that preserve social behaviours and their causalities. Based on this framework two platforms were built to capture insights from fast-paced and slow-paced social data. For fast-paced, a self-structuring and incremental learning technique was developed to automatically capture salient topics and corresponding dynamics over time. An event detection technique was developed to automatically monitor those identified topic pathways for significant fluctuations in social behaviours using multiple indicators such as volume and sentiment. This platform is demonstrated using two large datasets with over 1 million tweets. The separated topic pathways were representative of the key topics of each entity and coherent against topic coherence measures. Identified events were validated against contemporary events reported in news. Secondly for the slow-paced social data, a suite of new machine learning and natural language processing techniques were developed to automatically capture self-disclosed information of the individuals such as demographics, emotions and timeline of personal events. This platform was trialled on a large text corpus of over 4 million posts collected from online support groups. This was further extended to transform prostate cancer related online support group discussions into a multidimensional representation and investigated the self-disclosed quality of life of patients (and partners) against time, demographics and clinical factors. The capabilities of this extended platform have been demonstrated using a text corpus collected from 10 prostate cancer online support groups comprising of 609,960 prostate cancer discussions and 22,233 patients.
Sep-5-2020
- Country:
- Oceania
- Australia > Victoria (0.04)
- New Zealand (0.04)
- North America
- Panama (0.04)
- Canada (0.04)
- United States
- District of Columbia > Washington (0.04)
- Hawaii (0.04)
- Arizona (0.04)
- Pennsylvania > Allegheny County
- Pittsburgh (0.13)
- New York > New York County
- New York City (0.14)
- Massachusetts > Suffolk County
- Boston (0.04)
- Georgia > Fulton County
- Atlanta (0.04)
- California
- San Francisco County > San Francisco (0.14)
- Los Angeles County > Los Angeles (0.13)
- San Diego County > San Diego (0.04)
- Cuba > Guantánamo Province
- Guantánamo (0.04)
- Europe
- France (0.04)
- Sweden (0.04)
- Germany (0.04)
- Norway (0.04)
- Western Europe (0.04)
- Russia > Central Federal District
- Moscow Oblast > Moscow (0.04)
- Belarus > Minsk Region
- Minsk (0.04)
- Middle East > Malta
- Port Region > Southern Harbour District > Valletta (0.04)
- Spain
- Netherlands
- South Holland > Rotterdam (0.04)
- North Holland > Amsterdam (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Oxfordshire > Oxford (0.04)
- Greater London > London (0.04)
- Ukraine
- Kyiv Oblast > Kyiv (0.04)
- Crimea (0.04)
- Asia
- Russia (0.45)
- India (0.04)
- Afghanistan (0.04)
- China (0.04)
- Middle East
- Iran (0.14)
- Saudi Arabia (0.13)
- Israel (0.04)
- Iraq (0.04)
- Jordan (0.04)
- Oceania
- Genre:
- Overview (1.00)
- Questionnaire & Opinion Survey (0.92)
- Workflow (0.92)
- Research Report
- Strength High (1.00)
- New Finding (1.00)
- Experimental Study (1.00)
- Industry:
- Health & Medicine > Therapeutic Area
- Urology (1.00)
- Psychiatry/Psychology > Mental Health (1.00)
- Neurology (1.00)
- Infections and Infectious Diseases (1.00)
- Oncology > Prostate Cancer (0.89)
- Government > Regional Government
- Health & Medicine > Therapeutic Area
- Technology:
- Information Technology
- Communications > Social Media (1.00)
- Artificial Intelligence
- Cognitive Science > Emotion (0.93)
- Representation & Reasoning
- Rule-Based Reasoning (1.00)
- Ontologies (0.92)
- Natural Language
- Text Processing (1.00)
- Information Retrieval (1.00)
- Information Extraction (1.00)
- Discourse & Dialogue (1.00)
- Machine Learning
- Statistical Learning > Clustering (1.00)
- Performance Analysis (0.92)
- Neural Networks > Deep Learning (0.67)
- Learning Graphical Models > Directed Networks
- Bayesian Learning (0.45)
- Information Technology