Abstract
We present a general framework for designing efficient data structures for high-dimensional pattern-matching problems (λ i ϵ [n], f(xi,y) = 1) through communication models in which f(x,y) admits sublinear communication protocols with exponentially-small error. Specifically, we reduce the data structure problem to the Unambiguous Arthur-Merlin (UAM) communication complexity of f(x,y) under product distributions. We apply our framework to the Partial Match problem (a.k.a, matching with wildcards), whose underlying communication problem is sparse set-disjointness. When the database consists of n points in dimension d, and the number of ∗'s in the query is at most w = c logn λ€.,(λ‰ d), the fastest known linear-space data structure (Cole, Gottlieb and Lewenstein, STOC'04) had query time t ≈ 2w = nc, which is nontrivial only when c<1. By contrast, our framework produces a data structure with query time n1-1/(c log2 c) and space close to linear. To achieve this, we develop a one-sided ϵ-error communication protocol for set-disjointness under product distributions with (λd log(1/ϵ)) complexity, improving on the classical result of Babai, Frankl and Simon (FOCS'86). Building on this protocol, we show that the Unambiguous AM communication complexity of w-Sparse set-disjointness with ϵ-error under product distributions is Õ(λw log(1/ϵ)), independent of the ambient dimension d, which is crucial for the partial match result. Our framework sheds further light on the power of data-dependent data structures, which is instrumental for reducing to the (much easier) case of product distributions.
| Original language | English |
|---|---|
| Title of host publication | STOC 2025 - Proceedings of the 57th Annual ACM Symposium on Theory of Computing |
| Editors | Michal Koucky, Nikhil Bansal |
| Publisher | Association for Computing Machinery |
| Pages | 256-267 |
| Number of pages | 12 |
| ISBN (Electronic) | 9798400715105 |
| DOIs | |
| State | Published - 15 Jun 2025 |
| Event | 57th Annual ACM Symposium on Theory of Computing, STOC 2025 - Prague, Czech Republic Duration: 23 Jun 2025 → 27 Jun 2025 |
Publication series
| Name | Proceedings of the Annual ACM Symposium on Theory of Computing |
|---|---|
| ISSN (Print) | 0737-8017 |
Conference
| Conference | 57th Annual ACM Symposium on Theory of Computing, STOC 2025 |
|---|---|
| Country/Territory | Czech Republic |
| City | Prague |
| Period | 23/06/25 → 27/06/25 |
Bibliographical note
Publisher Copyright:© 2025 Copyright is held by the owner/author(s). Publication rights licensed to ACM.
Keywords
- Communication Complexity
- Data Structures
- Partial Match
Fingerprint
Dive into the research topics of 'A Framework for Building Data Structures from Communication Protocols'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver