Quality control in sublinear time: a case study via random graphs
Marcussen, Cassandra, Rubinfeld, Ronitt, Sudan, Madhu
–arXiv.org Artificial Intelligence
Many algorithms are designed to work well on average over inputs. When running such an algorithm on an arbitrary input, we must ask: Can we trust the algorithm on this input? We identify a new class of algorithmic problems addressing this, which we call "Quality Control Problems." These problems are specified by a (positive, real-valued) "quality function" $ρ$ and a distribution $D$ such that, with high probability, a sample drawn from $D$ is "high quality," meaning its $ρ$-value is near $1$. The goal is to accept inputs $x \sim D$ and reject potentially adversarially generated inputs $x$ with $ρ(x)$ far from $1$. The objective of quality control is thus weaker than either component problem: testing for "$ρ(x) \approx 1$" or testing if $x \sim D$, and offers the possibility of more efficient algorithms. In this work, we consider the sublinear version of the quality control problem, where $D \in Δ(\{0,1\}^N)$ and the goal is to solve the $(D ,ρ)$-quality problem with $o(N)$ queries and time. As a case study, we consider random graphs, i.e., $D = G_{n,p}$ (and $N = \binom{n}2$), and the $k$-clique count function $ρ_k := C_k(G)/\mathbb{E}_{G' \sim G_{n,p}}[C_k(G')]$, where $C_k(G)$ is the number of $k$-cliques in $G$. Testing if $G \sim G_{n,p}$ with one sample, let alone with sublinear query access to the sample, is of course impossible. Testing if $ρ_k(G)\approx 1$ requires $p^{-Ω(k^2)}$ samples. In contrast, we show that the quality control problem for $G_{n,p}$ (with $n \geq p^{-ck}$ for some constant $c$) with respect to $ρ_k$ can be tested with $p^{-O(k)}$ queries and time, showing quality control is provably superpolynomially more efficient in this setting. More generally, for a motif $H$ of maximum degree $Δ(H)$, the respective quality control problem can be solved with $p^{-O(Δ(H))}$ queries and running time.
arXiv.org Artificial Intelligence
Sep-8-2025
- Country:
- Africa > Sudan (0.40)
- Asia > Middle East
- Israel (0.04)
- Europe
- Czechia > Prague (0.04)
- France > Île-de-France
- Germany (0.04)
- Greece > Attica
- Athens (0.04)
- Middle East > Cyprus
- Romania > Vest Development Region
- Timiș County > Timișoara (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- North America
- Canada > Quebec
- Montreal (0.04)
- United States
- Pennsylvania > Allegheny County
- Pittsburgh (0.04)
- Washington > King County
- Seattle (0.13)
- Virginia > Alexandria County
- Alexandria (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.14)
- Oregon > Multnomah County
- Portland (0.04)
- Utah > Salt Lake County
- Salt Lake City (0.04)
- Maryland > Baltimore (0.04)
- Ohio > Cuyahoga County
- Shaker Heights (0.04)
- California > San Diego County
- San Diego (0.04)
- Florida > Orange County
- Orlando (0.04)
- Pennsylvania > Allegheny County
- Canada > Quebec
- Genre:
- Research Report (0.82)
- Technology: