philippines
Flock Uses Overseas Gig Workers to Build Its Surveillance AI
An accidental leak revealed that Flock, which has cameras in thousands of US communities, is using workers in the Philippines to review and classify footage. Flock, the automatic license plate reader and AI-powered camera company, uses overseas workers from Upwork to train its machine learning algorithms, with training material telling workers how to review and categorize footage including images people and vehicles in the United States, according to material reviewed by 404 Media that was accidentally exposed by the company. The findings bring up questions about who exactly has access to footage collected by Flock surveillance cameras and where people reviewing the footage may be based. Flock has become a pervasive technology in the US, with its cameras present in thousands of communities that cops use every day to investigate things like carjackings. Local police have also performed numerous lookups for ICE in the system.
- Asia > Philippines (0.25)
- North America > United States > New York (0.05)
- North America > United States > California (0.05)
- (6 more...)
Six new bat species discovered in the Philippines
The archipelago's total bat tally now stands at 85 different flying mammals. Breakthroughs, discoveries, and DIY tips sent every weekday. A few weeks ago, conservationists announced the milestone discovery of the 1500th known bat species, . October's Bat Appreciation Month celebrations apparently aren't done yet. With only a few days remaining before Halloween, a research team has described another new species residing in the Philippines for the journal .
- Asia > Philippines (0.64)
- North America > United States (0.31)
- North America > Canada > Ontario (0.06)
- (4 more...)
The Curious Case of Curiosity across Human Cultures and LLMs
Borah, Angana, Jin, Zhijing, Mihalcea, Rada
Recent advances in Large Language Models (LLMs) have expanded their role in human interaction, yet curiosity -- a central driver of inquiry -- remains underexplored in these systems, particularly across cultural contexts. In this work, we investigate cultural variation in curiosity using Yahoo! Answers, a real-world multi-country dataset spanning diverse topics. We introduce CUEST (CUriosity Evaluation across SocieTies), an evaluation framework that measures human-model alignment in curiosity through linguistic (style), topic preference (content) analysis and grounding insights in social science constructs. Across open- and closed-source models, we find that LLMs flatten cross-cultural diversity, aligning more closely with how curiosity is expressed in Western countries. We then explore fine-tuning strategies to induce curiosity in LLMs, narrowing the human-model alignment gap by up to 50%. Finally, we demonstrate the practical value of curiosity for LLM adaptability across cultures, showing its importance for future NLP research.
- North America > Canada > Ontario > Toronto (0.14)
- Europe > Austria > Vienna (0.14)
- South America > Brazil (0.06)
- (30 more...)
Intersectional Bias in Japanese Large Language Models from a Contextualized Perspective
Yanaka, Hitomi, He, Xinqi, Lu, Jie, Han, Namgi, Oh, Sunjin, Kumon, Ryoma, Matsuoka, Yuma, Watabe, Katsuhiko, Itatsu, Yuko
An increasing number of studies have examined the social bias of rapidly developed large language models (LLMs). Although most of these studies have focused on bias occurring in a single social attribute, research in social science has shown that social bias often occurs in the form of intersectionality -- the constitutive and contextualized perspective on bias aroused by social attributes. In this study, we construct the Japanese benchmark inter-JBBQ, designed to evaluate the intersectional bias in LLMs on the question-answering setting. Using inter-JBBQ to analyze GPT-4o and Swallow, we find that biased output varies according to its contexts even with the equal combination of social attributes.
- Asia > Philippines (0.07)
- Asia > Vietnam (0.06)
- Europe > Italy (0.06)
- (9 more...)
- Government (0.69)
- Education > Educational Setting > K-12 Education (0.51)
AI and disinformation fuel political rivalries in the Philippines
Manila, Philippines – When former Philippines President Rodrigo Duterte was arrested by the International Criminal Court (ICC) in March, Sheerah Escuerdo spoke to a local television station, welcoming the politician's detention on charges of murder linked to his war on drugs. Escuerdo, who lost her 18-year-old brother, Ephraim, to Duterte's war, clutched a portrait of her sibling during the interview with News 5 Everywhere as she demanded justice for his killing. Days later, she was shocked to find an AI-generated video of her slain brother circulating on Facebook, in which he said he was alive and accused his sister of lying. Are they paying you to do this?" the computer-generated image of Ephraim said. The video, posted online by a pro-Duterte influencer with 11,000 followers, immediately drew thousands of views on Facebook. One of the comments read, "Fake drug war victims". It was Escudero and her brother's image from her News 5 Everywhere interview that the influencer had used to ...
- Asia > Philippines > Luzon > National Capital Region > City of Manila (0.56)
- North America > United States (0.15)
- Government > Regional Government > Asia Government > Philippines Government (0.70)
- Media > News (0.62)
- Law > Criminal Law (0.55)
- Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.55)
Hierarchical Memory Organization for Wikipedia Generation
Yu, Eugene J., Zhu, Dawei, Song, Yifan, Wong, Xiangyu, Zhang, Jiebin, Shi, Wenxuan, Li, Xiaoguang, Liu, Qun, Li, Sujian
Generating Wikipedia articles autonomously is a challenging task requiring the integration of accurate, comprehensive, and well-structured information from diverse sources. This paper introduces the Memory Organization-based Generation (MOG) framework, a novel approach to address these challenges by leveraging a hierarchical memory architecture. MOG extracts fine-grained memory units from web documents, recursively organizes them into a Wikipedia-style hierarchical structure, and uses this structure to guide the generation process. This ensures alignment between memory and the article outline, improving both informativeness and verifiability while minimizing hallucinations. Additionally, a citation module is implemented to enhance traceability by linking every generated sentence to specific memory units. Evaluations on our newly created WikiStart dataset demonstrate that MOG outperforms baseline methods in producing informative and reliable articles, making it particularly robust in real-world scenarios.
- Asia > Philippines (0.15)
- Asia > Malaysia (0.14)
- Asia > Singapore (0.06)
- (16 more...)
- Leisure & Entertainment > Sports > Olympic Games (0.68)
- Consumer Products & Services > Travel (0.68)
- Information Technology > Communications > Social Media (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.70)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)
- Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.46)
Cultural Awareness in Vision-Language Models: A Cross-Country Exploration
Madasu, Avinash, Lal, Vasudev, Howard, Phillip
Vision-Language Models (VLMs) are increasingly deployed in diverse cultural contexts, yet their internal biases remain poorly understood. In this work, we propose a novel framework to systematically evaluate how VLMs encode cultural differences and biases related to race, gender, and physical traits across countries. We introduce three retrieval-based tasks: (1) Race to Country retrieval, which examines the association between individuals from specific racial groups (East Asian, White, Middle Eastern, Latino, South Asian, and Black) and different countries; (2) Personal Traits to Country retrieval, where images are paired with trait-based prompts (e.g., Smart, Honest, Criminal, Violent) to investigate potential stereotypical associations; and (3) Physical Characteristics to Country retrieval, focusing on visual attributes like skinny, young, obese, and old to explore how physical appearances are culturally linked to nations. Our findings reveal persistent biases in VLMs, highlighting how visual representations may inadvertently reinforce societal stereotypes.
- Asia > Middle East > UAE (0.19)
- North America > United States (0.18)
- Africa > Democratic Republic of the Congo (0.15)
- (52 more...)
The Challenge of Achieving Attributability in Multilingual Table-to-Text Generation with Question-Answer Blueprints
Multilingual Natural Language Generation (NLG) is challenging due to the lack of training data for low-resource languages. However, some low-resource languages have up to tens of millions of speakers globally, making it important to improve NLG tools for them. Table-to-Text NLG is an excellent measure of models' reasoning abilities but is very challenging in the multilingual setting. System outputs are often not attributable, or faithful, to the data in the source table. Intermediate planning techniques like Question-Answer (QA) blueprints have been shown to improve attributability on summarisation tasks. This work explores whether QA blueprints make multilingual Table-to-Text outputs more attributable to the input tables. This paper extends the challenging multilingual Table-to-Text dataset, TaTA, which includes African languages, with QA blueprints. Sequence-to-sequence language models are then finetuned on this dataset, with and without blueprints. Results show that QA blueprints improve performance for models finetuned and evaluated only on English examples, but do not demonstrate gains in the multilingual setting. This is due to inaccuracies in machine translating the blueprints from English into target languages when generating the training data, and models failing to rely closely on the blueprints they generate. An in-depth analysis is conducted on why this is challenging.
- Africa > Mali (0.06)
- Asia > Philippines (0.05)
- Africa > Nigeria (0.05)
- (7 more...)
Batayan: A Filipino NLP benchmark for evaluating Large Language Models
Montalan, Jann Railey, Layacan, Jimson Paulo, Africa, David Demitri, Flores, Richell Isaiah, Lopez, Michael T. II, Magsajo, Theresa Denise, Cayabyab, Anjanette, Tjhi, William Chandra
Recent advances in large language models (LLMs) have demonstrated remarkable capabilities on widely benchmarked high-resource languages; however, linguistic nuances of under-resourced languages remain unexplored. We introduce Batayan, a holistic Filipino benchmark designed to systematically evaluate LLMs across three key natural language processing (NLP) competencies: understanding, reasoning, and generation. Batayan consolidates eight tasks, covering both Tagalog and code-switched Taglish utterances. Our rigorous, native-speaker-driven annotation process ensures fluency and authenticity to the complex morphological and syntactic structures of Filipino, alleviating a pervasive translationese bias in existing Filipino corpora. We report empirical results on a variety of multilingual LLMs, highlighting significant performance gaps that signal the under-representation of Filipino in pretraining corpora, the unique hurdles in modeling Filipino's rich morphology and construction, and the importance of explicit Filipino language support and instruction tuning. Moreover, we discuss the practical challenges encountered in dataset construction and propose principled solutions for building culturally and linguistically-faithful resources in under-represented languages. We also provide a public benchmark and leaderboard as a clear foundation for iterative, community-driven progress in Filipino NLP.
- North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
- Asia > Singapore (0.05)
- South America > Colombia > Meta Department > Villavicencio (0.04)
- (17 more...)
- Government (0.68)
- Education (0.46)
- Social Sector (0.46)
- Health & Medicine > Therapeutic Area > Immunology (0.46)
Kalahi: A handcrafted, grassroots cultural LLM evaluation suite for Filipino
Montalan, Jann Railey, Ngui, Jian Gang, Leong, Wei Qi, Susanto, Yosephine, Rengarajan, Hamsawardhini, Aji, Alham Fikri, Tjhi, William Chandra
Multilingual large language models (LLMs) today may not necessarily provide culturally appropriate and relevant responses to its Filipino users. We introduce Kalahi, a cultural LLM evaluation suite collaboratively created by native Filipino speakers. It is composed of 150 high-quality, handcrafted and nuanced prompts that test LLMs for generations that are relevant to shared Filipino cultural knowledge and values. Strong LLM performance in Kalahi indicates a model's ability to generate responses similar to what an average Filipino would say or do in a given situation. We conducted experiments on LLMs with multilingual and Filipino language support. Results show that Kalahi, while trivial for Filipinos, is challenging for LLMs, with the best model answering only 46.0% of the questions correctly compared to native Filipino performance of 89.10%. Thus, Kalahi can be used to accurately and reliably evaluate Filipino cultural representation in LLMs.
- Asia > Singapore (0.04)
- Asia > Philippines > Luzon > National Capital Region > City of Manila (0.04)
- Asia > Middle East > Jordan (0.04)
- (22 more...)
- Education (0.94)
- Health & Medicine > Therapeutic Area (0.94)
- Health & Medicine > Consumer Health (0.67)