Efficiently Factorizing Boolean Matrices using Proximal Gradient Descent
Dalleiger, Sebastian, Vreeken, Jilles
–arXiv.org Artificial Intelligence
Addressing the interpretability problem of NMF on Boolean data, Boolean Matrix Factorization (BMF) uses Boolean algebra to decompose the input into low-rank Boolean factor matrices. These matrices are highly interpretable and very useful in practice, but they come at the high computational cost of solving an NP-hard combinatorial optimization problem. To reduce the computational burden, we propose to relax BMF continuously using a novel elastic-binary regularizer, from which we derive a proximal gradient algorithm. Through an extensive set of experiments, we demonstrate that our method works well in practice: On synthetic data, we show that it converges quickly, recovers the ground truth precisely, and estimates the simulated rank exactly. On real-world data, we improve upon the state of the art in recall, loss, and runtime, and a case study from the medical domain confirms that our results are easily interpretable and semantically meaningful.
arXiv.org Artificial Intelligence
Jul-14-2023
- Country:
- Asia
- Middle East > Israel
- Haifa District > Haifa (0.04)
- Singapore (0.04)
- Middle East > Israel
- Europe
- Bulgaria > Varna Province
- Varna (0.04)
- Germany > Saarland (0.04)
- North Macedonia > Skopje Statistical Region
- Skopje Municipality > Skopje (0.04)
- Sweden > Stockholm
- Stockholm (0.04)
- Bulgaria > Varna Province
- North America
- Canada
- Alberta > Census Division No. 6
- Calgary Metropolitan Region > Calgary (0.04)
- Nova Scotia > Halifax Regional Municipality
- Halifax (0.04)
- Quebec > Montreal (0.04)
- Alberta > Census Division No. 6
- United States
- California
- Los Angeles County > Long Beach (0.04)
- Monterey County > Pacific Grove (0.04)
- San Diego County > San Diego (0.04)
- Colorado > Denver County
- Denver (0.04)
- Florida > Miami-Dade County
- Miami (0.04)
- Nebraska > Douglas County
- Omaha (0.04)
- New York
- Bronx County > New York City (0.04)
- Kings County > New York City (0.04)
- New York County > New York City (0.14)
- Queens County > New York City (0.04)
- Richmond County > New York City (0.04)
- California
- Canada
- Oceania
- Australia > New South Wales
- Sydney (0.04)
- New Zealand > North Island
- Auckland Region > Auckland (0.04)
- Australia > New South Wales
- Asia
- Genre:
- Research Report > New Finding (0.48)
- Industry:
- Health & Medicine
- Pharmaceuticals & Biotechnology (0.93)
- Therapeutic Area > Oncology (1.00)
- Information Technology (0.69)
- Health & Medicine
- Technology: