When Subgraph Isomorphism is Really Hard, and Why This Matters for Graph Databases
McCreesh, Ciaran, Prosser, Patrick, Solnon, Christine, Trimble, James
–Journal of Artificial Intelligence Research
The subgraph isomorphism problem involves deciding whether a copy of a pattern graph occurs inside a larger target graph. The non-induced version allows extra edges in the target, whilst the induced version does not. Although both variants are NP-complete, algorithms inspired by constraint programming can operate comfortably on many real-world problem instances with thousands of vertices. However, they cannot handle arbitrary instances of this size. We show how to generate "really hard" random instances for subgraph isomorphism problems, which are computationally challenging with a couple of hundred vertices in the target, and only twenty pattern vertices. For the non-induced version of the problem, these instances lie on a satisfiable / unsatisfiable phase transition, whose location we can predict; for the induced variant, much richer behaviour is observed, and constrainedness gives a better measure of difficulty than does proximity to a phase transition. These results have practical consequences: we explain why the widely researched "filter / verify" indexing technique used in graph databases is founded upon a misunderstanding of the empirical hardness of NP-complete problems, and cannot be beneficial when paired with any reasonable subgraph isomorphism algorithm.
Journal of Artificial Intelligence Research
Mar-30-2018
- Country:
- Oceania > Australia
- Victoria > Melbourne (0.04)
- New South Wales > Sydney (0.04)
- North America
- Canada > Ontario (0.04)
- United States
- Wisconsin > Dane County
- Madison (0.04)
- Washington > King County
- Seattle (0.04)
- Oregon > Multnomah County
- Portland (0.04)
- New York > New York County
- New York City (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- California
- Santa Clara County > San Jose (0.04)
- Los Angeles County > Pasadena (0.04)
- Wisconsin > Dane County
- Mexico > Quintana Roo
- Cancún (0.04)
- Europe
- Hungary > Hajdú-Bihar County
- Debrecen (0.04)
- Austria
- Vienna (0.14)
- Upper Austria > Linz (0.04)
- Portugal > Porto
- Porto (0.04)
- Germany > Brandenburg
- Potsdam (0.04)
- Ireland > Munster
- County Cork > Cork (0.04)
- France
- Île-de-France > Paris
- Paris (0.04)
- Occitanie > Haute-Garonne
- Toulouse (0.04)
- Nouvelle-Aquitaine > Gironde
- Bordeaux (0.04)
- Auvergne-Rhône-Alpes > Lyon
- Lyon (0.04)
- Île-de-France > Paris
- Italy > Veneto
- Venice (0.04)
- Russia > Northwestern Federal District
- Leningrad Oblast > Saint Petersburg (0.04)
- Middle East > Republic of Türkiye
- Istanbul Province > Istanbul (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- United Kingdom > Scotland
- City of Glasgow > Glasgow (0.04)
- Hungary > Hajdú-Bihar County
- Asia
- Russia (0.04)
- Middle East > Republic of Türkiye
- Istanbul Province > Istanbul (0.04)
- China > Beijing
- Beijing (0.04)
- Oceania > Australia
- Genre:
- Research Report > New Finding (0.45)
- Industry:
- Health & Medicine (0.46)
- Technology: