Machine Learning for Tangible Effects: Natural Language Processing for Uncovering the Illicit Massage Industry & Computer Vision for Tactile Sensing
–arXiv.org Artificial Intelligence
I explore two questions in this thesis: how can computer science be used to fight human trafficking? And how can computer vision create a sense of touch? I use natural language processing (NLP) to monitor the United States illicit massage industry (IMI), a multi-billion dollar industry that offers not just therapeutic massages but also commercial sexual services. Employees of this industry are often immigrant women with few job opportunities, leaving them vulnerable to fraud, coercion, and other facets of human trafficking. Monitoring spatiotemporal trends helps prevent trafficking in the IMI. By creating datasets with three publicly-accessible websites: Google Places, Rubmaps, and AMPReviews, combined with NLP techniques such as bag-of-words and Word2Vec, I show how to derive insights into the labor pressures and language barriers that employees face, as well as the income, demographics, and societal pressures affecting sex buyers. I include a call-to-action to other researchers given these datasets. I also consider how to creating synthetic financial data, which can aid with counter-trafficking in the banking sector. I use an agent-based model to create both tabular and payee-recipient graph data. I then consider the role of computer vision in making tactile sensors. I report on a novel sensor, the Digger Finger, that adapts the Gelsight sensor to finding objects in granular media. Changes include using a wedge shape to facilitate digging, replacing the internal lighting LEDs with fluorescent paint, and adding a vibrator motor to counteract jamming. Finally, I also show how to use a webcam and a printed reference marker, or fiducial, to create a low-cost six-axis force-torque sensor. This sensor is up to a hundred times less expensive than commercial sensors, allowing for a wider range of applications. For this and earlier chapters I release design files and code as open source.
arXiv.org Artificial Intelligence
Sep-7-2023
- Country:
- North America
- Canada > Ontario (0.04)
- United States
- New Jersey (0.04)
- Nevada > Clark County
- Las Vegas (0.04)
- Arizona > Maricopa County
- Phoenix (0.04)
- Texas
- Harris County > Houston (0.27)
- Travis County > Austin (0.04)
- Dallas County > Dallas (0.04)
- Hawaii > Honolulu County
- Honolulu (0.04)
- New York
- Richmond County > New York City (0.04)
- Queens County > New York City (0.04)
- New York County > New York City (0.04)
- Kings County > New York City (0.04)
- Bronx County > New York City (0.04)
- Illinois > Cook County
- Chicago (0.04)
- New Mexico
- Los Alamos County > Los Alamos (0.04)
- Bernalillo County > Albuquerque (0.04)
- Pennsylvania
- Philadelphia County > Philadelphia (0.14)
- Lehigh County > Allentown (0.04)
- Massachusetts
- Suffolk County > Boston (0.04)
- Middlesex County > Cambridge (0.04)
- California
- San Francisco County > San Francisco (0.04)
- San Diego County > San Diego (0.04)
- Los Angeles County > Los Angeles (0.04)
- Europe
- Asia
- East Asia (0.04)
- China (0.04)
- Middle East
- North America
- Genre:
- Research Report
- New Finding (1.00)
- Experimental Study (0.68)
- Research Report
- Industry:
- Banking & Finance (1.00)
- Law Enforcement & Public Safety
- Fraud (1.00)
- Crime Prevention & Enforcement (1.00)
- Law
- Criminal Law (1.00)
- Civil Rights & Constitutional Law (0.69)
- Information Technology
- Services (1.00)
- Security & Privacy (0.92)
- Health & Medicine
- Therapeutic Area > Infections and Infectious Diseases (0.67)
- Epidemiology (0.67)
- Government
- Immigration & Customs (1.00)
- Military (0.67)
- Regional Government > North America Government
- United States Government (1.00)
- Technology:
- Information Technology > Artificial Intelligence
- Robots > Manipulation (1.00)
- Representation & Reasoning > Agents (1.00)
- Natural Language > Text Processing (1.00)
- Machine Learning
- Performance Analysis > Accuracy (1.00)
- Neural Networks > Deep Learning (1.00)
- Statistical Learning > Regression (0.68)
- Information Technology > Artificial Intelligence