My main interests are in complexity theory, pseudorandomness, algorithms, learning theory and data mining.
More generally, I like probability and combinatorics related things. Here's my
CV. Here are some of my
talks.
Theory Stuff
Pseudorandomness from Shrinkage
Manuscript. Russell Impagliazzo, Raghu Meka, David Zuckerman
ABSTRACT ECCC
One powerful theme in complexity theory and pseudorandomness in the past few decades has been the use of lower bounds to give pseudorandom generators (PRGs). However, the general results using this hardness vs. randomness paradigm suffer a quantitative loss in parameters, and hence do not give nontrivial implications for models where we only know lower bounds of a fixed polynomial. We show that when such lower bounds are proved using random restrictions, we can indeed construct PRGs which are essentially best possible without in turn improving the lower bounds.
More specifically, say that a circuit family has shrinkage exponent Gamma if a random restriction leaving a p fraction of variables unset shrinks the size of any circuit in the family by a factor of p^Gamma. Our PRG uses a seed of length roughly s^{1/(Gamma+1)} to fool circuits in the family of size s. By instantiating this generic construction, we get PRGs for the following classes:
1) de Morgan formulas of size s, seed length s^{1/3+o(1)}.
2) Formulas over an arbitrary basis of size s, seed length s^{1/2+o(1)}.
3) Read-once formulas, seed length s^{.234...}.
4) Branching programs of size s, seed length s^{1/2+o(1)}.
The previous best PRGs known for these classes used seeds of length bigger than n/2 to output n bits, and worked only when the size s=O(n).
Learning Functions of Halfspaces using Prefix Covers
COLT 2012. Parikshit Gopalan, Adam Klivans, Raghu Meka
Constructive Discrepancy Minimization by Walking on The Edges
Manuscript. Shachar Lovett, Raghu Meka
ABSTRACT arXiv
Minimizing the discrepancy of a set system is a fundamental problem in combinatorics. One of the cornerstones in this area is the celebrated six standard deviations result of Spencer (AMS 1985): In any system of n sets in a universe of size n, there always exists a coloring which achieves discrepancy 6\sqrt{n}. The original proof of Spencer was existential in nature, and did not give an efficient algorithm to find such a coloring. Recently, a breakthrough work of Bansal (FOCS 2010) gave an efficient algorithm which finds such a coloring. His algorithm was based on an SDP relaxation of the discrepancy problem and a clever rounding procedure. In this work we give a new randomized algorithm to find a coloring as in Spencer's result based on a restricted random walk we call "Edge-Walk". Our algorithm and its analysis use only basic linear algebra and is "truly" constructive in that it does not appeal to the existential arguments, giving a new proof of Spencer's theorem and the partial coloring lemma.
A PTAS for Computing the Supremum of Gaussian Processes
Manuscript. Raghu Meka
ABSTRACT arXiv
We give a polynomial time approximation scheme (PTAS) for computing the supremum of a Gaussian process. That is, given a finite set of d-dimensional vectors V, we compute a (1+epsilon)-factor approximation to Ex_{X standard Gaussian}[sup_{v in V}||] deterministically in time poly(d) |V|^{O_epsilon(1)}. Previously, only a constant factor deterministic polynomial time approximation algorithm was known due to the work of Ding, Lee and Peres. This answers an open question of Lee and Ding.
The study of supremum of Gaussian processes is of considerable importance in probability with applications in functional analysis, convex geometry, and in light of the recent work of Ding, Lee and Peres, to random walks on finite graphs. As such our result could be of use elsewhere. In particular, combining with the recent work of Ding, our result yields a PTAS for computing the cover time of bounded degree graphs. Previously, such algorithms were known only for trees.
Along the way, we also give an explicit oblivious linear estimator for semi-norms in Gaussian space with optimal query complexity.
DNF Sparsification and Faster Deterministic Counting
CCC 2012. Parikshit Gopalan, Raghu Meka, Omer Reingold
Invited to Computational Complexity Special Issue on CCC 2012
ABSTRACT ECCC
We give a faster deterministic algorithm for approximately counting the number of satisfying solutions to a DNF or CNF. Given a DNF (or CNF) f on n variables we give a deterministic n^{\tilde{O}((log log n)^2)} time algorithm that computes an (additive) epsilon approximation to the fraction of satisfying assignments of f for epsilon = 1/poly(log n). The previous best algorithm due to Luby and Velickovic from nearly two decades ago had a run-time of n^{exp(O(sqrt{log log n}))}.
A crucial ingredient in our algorithm is a structural result which allows us to sparsify any small-width DNF formula. It says that any width w DNF (irrespective of the number of terms) can be epsilon-approximated by a width w DNF with at most (w log(1/epsilon))^{O(w)} terms. Further, our approximating DNFs have an additional ``sandwiching'' property which is crucial for applications to derandomization.
Making the long code shorter, with applications to the Unique Games Conjecture
Manuscript. Boaz Barak, Parikshit Gopalan, Johan Hastad, Raghu Meka, Prasad Raghavendra, David Steurer
ABSTRACT ECCC
The long code is a central tool in hardness of approximation, especially in
questions related to the unique games conjecture. We construct a new code that
is exponentially more ecient, but can still be used in many of these applications.
Using the new code we obtain exponential improvements over several known results,
including the following:
(1) We construct a small set expander graph with $exp(log^{Omega(1)} n)$ large eigenvalues. This answers an open question of Arora, Barak and Steurer (FOCS 2010) who asked whether one can improve over the noise graph on the Boolean hypercube that has poly(log n) such eigenvalues.
(2) A gadget that reduces unique games instances with linear constraints modulo K into instances with constant size alphabet space with a quasipolynomial (in K) blowup, improving over the previously known gadget with blowup of exp(K).
(3) An n variable integrality gap for Unique Games that that survives exp(poly(loglogn)) rounds of the SDP + Sherali Adams hierarchy, improving on the previously known bound of poly(loglogn).
We show a connection between the local testability of linear codes and small set
expansion in certain related Cayley graphs, and use this connection to derandomize
the noise graph on the Boolean hypercube.
PTAS for Knapsack and Related Problems using Branching Programs
FOCS 2011. Parikshit Gopalan, Adam Klivans and Raghu Meka
Conference version to be merged with
this paper by Daniel Stefankovich, Santhosh Vempala and Eric Vigoda.
ABSTRACT ECCC
We give a deterministic, polynomial-time algorithm for approximately counting the number of {0,1}-solutions to any instance of the knapsack problem. On an instance of length n with total weight W and accuracy parameter eps, our algorithm produces a (1 + eps)-multiplicative approximation in time poly(n,log W,1/eps). We also give algorithms with identical guarantees for general integer knapsack, the multidimensional knapsack problem (with a constant number of constraints) and for contingency tables (with a constant number of rows). Previously, only randomized approximation schemes were known for these problems due to work by Morris and Sinclair and work by Dyer.
Our algorithms work by constructing small-width, read-once branching programs for approximating the underlying solution space under a carefully chosen distribution. As a byproduct of this approach, we obtain new query algorithms for learning functions of k halfspaces with respect to the uniform distribution on {0,1}^n. The running time of our algorithm is polynomial in the accuracy parameter eps. Previously even for the case of k=2, only algorithms with an exponential dependence on eps were known.
Almost Optimal Explicit Johnson-Lindenstrauss Transformations
Random 2011. Daniel Kane, Raghu Meka and Jelani Nelson
Pseudorandom Generators for Combinatorial Shapes
STOC 2011. Parikshit Gopalan, Raghu Meka, Omer Reingold and David Zuckerman
ABSTRACT ECCC
We construct pseudorandom generators for {combinatorial shapes}, which substantially generalize combinatorial rectangles, small-bias spaces, 0/1 halfspaces, and 0/1 modular sums. A Boolean function f with domain [m]^n is an (m,n)-combinatorial shape if there exist sets A_1,\ldots,A_n \subseteq [m] such that f is given by a symmetric Boolean function h applied to the arguments 1(x_1 in A_1), 1(x_2 in A_2),..., 1(x_n in A_n). Our generator uses seed length O(log m + log n + log^2(1/eps)) to get error eps. When m =2, this gives the first generator of seed length O(log n) which fools all weight-based tests, meaning that the distribution of the weight of any subset is eps-close to the appropriate binomial distribution in statistical distance.
For our proof we give a simple lemma which allows us to convert closeness in Kolmogorov (cdf) distance to closeness in statistical distance. As a corollary of our technique, we give an alternative proof of a powerful variant of the classical central limit theorem showing convergence in statistical distance, instead of the usual Kolmogorov distance.
An Invariance Principle for Polytopes
STOC 2010. Prahladh Harsha, Adam Klivans and Raghu Meka
ABSTRACT arXiv ECCC
Let X be randomly chosen from {-1,1}^n, and let Y be randomly chosen from the standard spherical Gaussian on R^n. For any (possibly unbounded) polytope P formed by the intersection of k halfspaces, we prove that
      |Pr [X belongs to P] - Pr [Y belongs to P]| < log^{8/5}k * Delta,
where Delta is a parameter that is small for polytopes formed by the intersection of "regular" halfspaces (i.e., halfspaces with low influence).
The novelty of our invariance principle is the polylogarithmic dependence on k. Previously, only bounds that were at least linear in k were known. We give two important applications of our main result:
(1) A polylogarithmic in k bound on the Boolean noise sensitivity of intersections of k "regular" halfspaces (previous work gave bounds linear in k).
(2) A pseudorandom generator (PRG) with seed length O((log n)*poly(log k,1/delta)) that delta-fools all polytopes with k faces with respect to the Gaussian distribution.
We also obtain PRGs with similar parameters that fool polytopes formed by intersection of regular halfspaces over the hypercube. Using our PRG constructions, we obtain the first deterministic quasi-polynomial time algorithms for approximately counting the number of solutions to a broad class of integer programs, including dense covering problems and contingency tables.
Pseudorandom Generators for Polynomial Threshold Functions
STOC 2010. Raghu Meka and David Zuckerman
Invited to SICOMP Special Issue on STOC 2010
ABSTRACT arXiv
We study the natural question of constructing pseudorandom generators (PRGs) for low-degree polynomial threshold functions (PTFs). We give a PRG with seed-length log n/eps^{O(d)} fooling degree d PTFs with error at most eps. Previously, no nontrivial constructions were known even for quadratic threshold functions and constant error eps. For the class of degree 1 threshold functions or halfspaces, we construct PRGs with much better dependence on the error parameter eps and obtain the following results.
1) A PRG with seed length O(log n log(1/eps)) for eps > 1/poly(n).
2) A PRG with seed length O(log n) for eps > 1/poly(log n).
Previously, only PRGs with seed length O(log n log^2(1/eps)/eps^2) were known for halfspaces. We also obtain PRGs with similar seed lengths for fooling halfspaces over the n-dimensional unit sphere.
The main theme of our constructions and analysis is the use of invariance principles to construct pseudorandom generators. We also introduce the notion of monotone read-once branching programs, which is key to improving the dependence on the error rate eps for halfspaces. These techniques may be of independent interest.
Bounding the Sensitivity of Polynomial Threshold Functions
STOC 2010. Prahladh Harsha, Adam Klivans and Raghu Meka.
Conference version to be merged with
this paper by Ilias Diakonikolas, Prasad Raghavendra, Rocco A. Servedio, Li-Yang Tan.
ABSTRACT arXiv
We give the first non-trivial upper bounds on the average sensitivity and noise sensitivity of polynomial threshold functions. More specifically, for a Boolean function f on n variables equal to the sign of a real, multivariate polynomial of total degree d we prove
1) The average sensitivity of f is at most O(n^{1-1/(4d+3)}) (we also give a simple combinatorial proof of the bound O(n^{1-1/2^d}).
2) The noise sensitivity of f with noise rate \delta is at most O(\delta^{1/(4d+6)}).
Previously, only bounds for the linear case were known. Along the way we show new structural theorems about random restrictions of polynomial threshold functions obtained via hypercontractivity. These structural results may be of independent interest as they provide a generic template for transforming problems related to polynomial threshold functions defined on the Boolean hypercube to polynomial threshold functions defined in Gaussian space.
Small-Bias Spaces for Group Products
Random 2009. Raghu Meka and David Zuckerman
ABSTRACT PDF
Small-bias, or $\epsilon$-biased, spaces have found many applications in complexity theory, coding theory, and derandomization. We generalize the notion of small-bias spaces to the setting of group products. Besides being natural, our extension captures some of the difficulties in constructing pseudorandom generators for constant-width branching programs - a longstanding open problem. We provide an efficient deterministic construction of small-bias spaces for solvable groups. Our construction exploits the fact that solvable groups have nontrivial normal subgroups that are abelian and builds on the construction of Azar et al.~\cite{azarmotwani} for abelian groups. For arbitrary finite groups, we give an efficient deterministic construction achieving constant bias. We also construct pseudorandom generators fooling linear functions mod $p$ for primes $p$.
Datamining Stuff
Guaranteed Rank Minimization via Singular Value Projection
NIPS 2010. Prateek Jain, Raghu Meka and Inderjit Dhillon.
ABSTRACT arXiv Code
Minimizing the rank of a matrix subject to affine constraints is a fundamental problem with many important applications in machine learning and statistics. In this paper we propose a simple and fast algorithm SVP (Singular Value Projection) for rank minimization with affine constraints (ARMP) and show that SVP recovers the minimum rank solution for affine constraints that satisfy the "restricted isometry property" and show robustness of our method to noise. Our results improve upon a recent breakthrough by Recht, Fazel and Parillo (RFP07) and Lee and Bresler (LB09) in
three significant ways:
1) our method (SVP) is significantly simpler to analyze and easier to implement,
2) we give recovery guarantees under strictly weaker isometry assumptions
3) we give geometric convergence guarantees for SVP even in presense of noise and, as demonstrated empirically, SVP is significantly faster on real-world and
synthetic problems.
In addition, we address the practically important problem of low-rank matrix completion (MCP), which can be seen as a special case of ARMP. We empirically demonstrate that our algorithm recovers low-rank incoherent matrices from an almost optimal number of uniformly sampled entries. We make partial progress towards proving exact recovery and provide some intuition for the strong performance of SVP applied to matrix completion by showing a more restricted isometry property. Our algorithm outperforms existing methods, such as those of \cite{RFP07,CR08,CT09,CCS08,KOM09,LB09}, for ARMP and the matrix-completion problem by an order of magnitude and is also significantly more robust to noise.
Matrix Completion from Power-Law Distributed Samples
NIPS 2009. Raghu Meka, Prateek Jain and Inderjit Dhillon.
ABSTRACT PDF
The low-rank matrix completion problem is a fundamental problem with many important applications. Recently, \cite{candesrecht},\cite{montanari} and \cite{candestao} obtained the first non-trivial theoretical results for the problem assuming that the observed entries are sampled uniformly at random. Unfortunately, most real-world datasets do not satisfy this assumption, but instead exhibit power-law distributed samples. In this paper, we propose a graph theoretic approach to matrix completion that solves the problem for more realistic sampling models. Our method is easier to analyze than previous methods with the analysis reducing to computing the threshold for {\sl complete cascades} in random graphs, a problem of independent interest. By analyzing the graph theoretic problem, we show that our method achieves exact recovery when the observed entries are sampled from the Chung-Lu-Vu model, which can generate power-law distributed graphs. We also hypothesize that our algorithm solves the matrix completion problem from an optimal number of entries for the popular preferential attachment model and provide strong empirical evidence for the claim. Furthermore, our method is easier to implement and is substantially faster than existing methods. We demonstrate the effectiveness of our method on examples when the low-rank matrix is sampled according to the prevalent random graph models for complex networks and also on the Netflix challenge dataset.
Rank Minimization via Online Learning
ICML 2008. Raghu Meka, Prateek Jain, Constantine Caramanis and Inderjit Dhillon
ABSTRACT BibTex PDF
Minimum rank problems arise frequently in machine learning applications and are notoriously difficult to solve due to the non-convex nature of the rank objective. In this paper, we present the first online learning approach for the problem of rank minimization of matrices over polyhedral sets. In particular, we present two online learning algorithms for rank minimization - our first algorithm is a multiplicative update method based on a generalized experts framework, while our second algorithm is a novel application of the online convex programming framework \cite{zinkevich}. In the latter, we flip the role of the decision maker by making the decision maker search over the constraint space instead of feasible points, as is usually the case in online convex programming. A salient feature of our online learning approach is that it allows us to give provable approximation guarantees for the rank minimization problem over polyhedral sets. We demonstrate the effectiveness of our methods on synthetic examples, and on the real-life application of low-rank kernel learning.
Simultaneous Unsupervised Learning of Disparate Clusterings
SDM 2008. Prateek Jain, Raghu Meka and Inderjit Dhillon.
Journal Version: Statistical Analysis and Data Mining, Volume 1, Issue 3.
ABSTRACT BibTex PDF Best Paper Runner-up
Most clustering algorithms produce a single clustering for a given data set even when the data can be clustered naturally in multiple ways. In this paper, we address the difficult problem of uncovering disparate clusterings from the data in a {\em totally unsupervised manner}. We propose two new approaches for this problem. In the first approach we aim to find good clusterings of the data that are also {\em decorrelated} with one another. To this end, we give a new and tractable characterization of decorrelation between clusterings, and present an objective function to capture it. We provide an iterative ``decorrelated'' $k$-means type algorithm to minimize this objective function. In the second approach, we model the data as a sum of mixtures and associate each mixture with a clustering. This approach leads us to the problem of learning a convolution of mixture distributions. Though the latter problem can be formulated as one of factorial learning \cite{zoubin,hinton,mcvq}, the existing formulations and methods do not perform well on many real high-dimensional data sets. We propose a new regularized factorial learning framework that is more suitable for capturing the notion of disparate clusterings in modern, high-dimensional data sets. The resulting algorithm does well in uncovering multiple clusterings, and is much improved over existing methods.