Gap Edit Distance via Non-Adaptive Queries: Simple and Optimal

We study the problem of approximating edit distance in sublinear time. This is formalized as the $(k,k^c)$-Gap Edit Distance problem, where the input is a pair of strings $X,Y$ and parameters $k,c>1$, and the goal is to return YES if $ED(X,Y)\leq k$, NO if $ED(X,Y)> k^c$, and an arbitrary answ...

Full description

Bibliographic Details
Main Authors: Goldenberg, Elazar, Kociumaka, Tomasz, Krauthgamer, Robert, Saha, Barna
Format: Text
Language:unknown
Published: 2021
Subjects:
Online Access:http://arxiv.org/abs/2111.12706
id ftarxivpreprints:oai:arXiv.org:2111.12706
record_format openpolar
spelling ftarxivpreprints:oai:arXiv.org:2111.12706 2023-09-05T13:22:56+02:00 Gap Edit Distance via Non-Adaptive Queries: Simple and Optimal Goldenberg, Elazar Kociumaka, Tomasz Krauthgamer, Robert Saha, Barna 2021-11-24 http://arxiv.org/abs/2111.12706 unknown http://arxiv.org/abs/2111.12706 Computer Science - Data Structures and Algorithms text 2021 ftarxivpreprints 2023-08-16T16:48:10Z We study the problem of approximating edit distance in sublinear time. This is formalized as the $(k,k^c)$-Gap Edit Distance problem, where the input is a pair of strings $X,Y$ and parameters $k,c>1$, and the goal is to return YES if $ED(X,Y)\leq k$, NO if $ED(X,Y)> k^c$, and an arbitrary answer when $k < ED(X,Y) \le k^c$. Recent years have witnessed significant interest in designing sublinear-time algorithms for Gap Edit Distance. In this work, we resolve the non-adaptive query complexity of Gap Edit Distance for the entire range of parameters, improving over a sequence of previous results. Specifically, we design a non-adaptive algorithm with query complexity $\tilde{O}(n/k^{c-0.5})$, and we further prove that this bound is optimal up to polylogarithmic factors. Our algorithm also achieves optimal time complexity $\tilde{O}(n/k^{c-0.5})$ whenever $c\geq 1.5$. For $1<c<1.5$, the running time of our algorithm is $\tilde{O}(n/k^{2c-1})$. In the restricted case of $k^c=\Omega(n)$, this matches a known result [Batu, Erg\"un, Kilian, Magen, Raskhodnikova, Rubinfeld, and Sami; STOC 2003], and in all other (nontrivial) cases, our running time is strictly better than all previous algorithms, including the adaptive ones. However, an independent work of Bringmann, Cassis, Fischer, and Nakos [STOC 2022] provides an adaptive algorithm that bypasses the non-adaptive lower bound, but only for small enough $k$ and $c$. Comment: Accepted to FOCS 2022 Text sami ArXiv.org (Cornell University Library)
institution Open Polar
collection ArXiv.org (Cornell University Library)
op_collection_id ftarxivpreprints
language unknown
topic Computer Science - Data Structures and Algorithms
spellingShingle Computer Science - Data Structures and Algorithms
Goldenberg, Elazar
Kociumaka, Tomasz
Krauthgamer, Robert
Saha, Barna
Gap Edit Distance via Non-Adaptive Queries: Simple and Optimal
topic_facet Computer Science - Data Structures and Algorithms
description We study the problem of approximating edit distance in sublinear time. This is formalized as the $(k,k^c)$-Gap Edit Distance problem, where the input is a pair of strings $X,Y$ and parameters $k,c>1$, and the goal is to return YES if $ED(X,Y)\leq k$, NO if $ED(X,Y)> k^c$, and an arbitrary answer when $k < ED(X,Y) \le k^c$. Recent years have witnessed significant interest in designing sublinear-time algorithms for Gap Edit Distance. In this work, we resolve the non-adaptive query complexity of Gap Edit Distance for the entire range of parameters, improving over a sequence of previous results. Specifically, we design a non-adaptive algorithm with query complexity $\tilde{O}(n/k^{c-0.5})$, and we further prove that this bound is optimal up to polylogarithmic factors. Our algorithm also achieves optimal time complexity $\tilde{O}(n/k^{c-0.5})$ whenever $c\geq 1.5$. For $1<c<1.5$, the running time of our algorithm is $\tilde{O}(n/k^{2c-1})$. In the restricted case of $k^c=\Omega(n)$, this matches a known result [Batu, Erg\"un, Kilian, Magen, Raskhodnikova, Rubinfeld, and Sami; STOC 2003], and in all other (nontrivial) cases, our running time is strictly better than all previous algorithms, including the adaptive ones. However, an independent work of Bringmann, Cassis, Fischer, and Nakos [STOC 2022] provides an adaptive algorithm that bypasses the non-adaptive lower bound, but only for small enough $k$ and $c$. Comment: Accepted to FOCS 2022
format Text
author Goldenberg, Elazar
Kociumaka, Tomasz
Krauthgamer, Robert
Saha, Barna
author_facet Goldenberg, Elazar
Kociumaka, Tomasz
Krauthgamer, Robert
Saha, Barna
author_sort Goldenberg, Elazar
title Gap Edit Distance via Non-Adaptive Queries: Simple and Optimal
title_short Gap Edit Distance via Non-Adaptive Queries: Simple and Optimal
title_full Gap Edit Distance via Non-Adaptive Queries: Simple and Optimal
title_fullStr Gap Edit Distance via Non-Adaptive Queries: Simple and Optimal
title_full_unstemmed Gap Edit Distance via Non-Adaptive Queries: Simple and Optimal
title_sort gap edit distance via non-adaptive queries: simple and optimal
publishDate 2021
url http://arxiv.org/abs/2111.12706
genre sami
genre_facet sami
op_relation http://arxiv.org/abs/2111.12706
_version_ 1776203503394357248