Keyword: random projections : Search

research-article

Open Access

Better Graph Embeddings for Enterprise Graphs

CODS-COMAD '24: Proceedings of the 7th Joint International Conference on Data Science & Management of Data (11th ACM IKDD CODS and 29th COMAD)Pages 368–374https://doi.org/10.1145/3632410.3632412

Graph embeddings are scalable and performant node representations in a graph. Fast Random Projections (FastRP) is claimed to be thousands of times faster to generate embeddings compared to random walk-based algorithms like DeepWalk and Node2Vec, while ...

research-article

Open Access

Depth-𝑑 Threshold Circuits vs. Depth-(𝑑+1) AND-OR Trees

STOC 2023: Proceedings of the 55th Annual ACM Symposium on Theory of ComputingPages 895–904https://doi.org/10.1145/3564246.3585216

For any n ∈ ℕ and d = o(loglog(n)), we prove that there is a Boolean function F on n bits and a value γ = 2^−Θ(d) such that F can be computed by a uniform depth-(d + 1) AC⁰ circuit with O(n) wires, but F cannot be computed by any depth-d TC⁰ circuit ...

research-article

Estimating Leverage Scores via Rank Revealing Methods and Randomization

SIAM Journal on Matrix Analysis and Applications (SIMAX), Volume 42, Issue 3Pages 1199–1228https://doi.org/10.1137/20M1314471

We study algorithms for estimating the statistical leverage scores of rectangular dense or sparse matrices of arbitrary rank. Our approach is based on combining rank revealing methods with compositions of dense and sparse randomized dimensionality reduction ...

research-article

Free

Randomized tests for high-dimensional regression: a more efficient and powerful solution

NIPS '20: Proceedings of the 34th International Conference on Neural Information Processing SystemsArticle No.: 396, Pages 4721–4732

We investigate the problem of testing the global null in the high-dimensional regression models when the feature dimension p grows proportionally to the number of observations n. Despite a number of prior work studying this problem, whether there exists ...

research-article

Free

Sparse projection oblique randomer forests

The Journal of Machine Learning Research (JMLR), Volume 21, Issue 1Article No.: 104, Pages 4193–4231

Decision forests, including Random Forests and Gradient Boosting Trees, have recently demonstrated state-of-the-art performance in a variety of machine learning settings. Decision forests are typically ensembles of axis-aligned decision trees; that is, ...

research-article

Oblivious dimension reduction for k-means: beyond subspaces and the Johnson-Lindenstrauss lemma

STOC 2019: Proceedings of the 51st Annual ACM SIGACT Symposium on Theory of ComputingPages 1039–1050https://doi.org/10.1145/3313276.3316318

We show that for n points in d-dimensional Euclidean space, a data oblivious random projection of the columns onto m∈ O((logk+loglogn)ε⁻⁶log1/ε) dimensions is sufficient to approximate the cost of all k-means clusterings up to a multiplicative (1±ε) ...

research-article

Public Access

Optimal terminal dimensionality reduction in Euclidean space

STOC 2019: Proceedings of the 51st Annual ACM SIGACT Symposium on Theory of ComputingPages 1064–1069https://doi.org/10.1145/3313276.3316307

Let ε∈(0,1) and X⊂^d be arbitrary with |X| having size n>1. The Johnson-Lindenstrauss lemma states there exists f:X→^m with m = O(ε⁻²logn) such that <table><tr><td> ∀ x∈ X ∀ y∈ X, ||x−y||₂ ≤ ||f(x)−f(y)||₂ ≤ (1+ε)||x−y||₂ . </td></tr></table> We show that ...

research-article

Public Access

An Average-Case Depth Hierarchy Theorem for Boolean Circuits

Journal of the ACM (JACM), Volume 64, Issue 5Article No.: 35, Pages 1–27https://doi.org/10.1145/3095799

We prove an average-case depth hierarchy theorem for Boolean circuits over the standard basis of AND, OR, and NOT gates. Our hierarchy theorem says that for every d ≥ 2, there is an explicit n-variable Boolean function f, computed by a linear-size depth-...

article

Free

Adaptive randomized dimension reduction on massive data

The Journal of Machine Learning Research (JMLR), Volume 18, Issue 1Pages 5134–5163

The scalability of statistical estimators is of increasing importance in modern applications. One approach to implementing scalable algorithms is to compress data into a low dimensional latent space using dimension reduction methods. In this paper, we ...

research-article

Poly-logarithmic Frege depth lower bounds via an expander switching lemma

STOC '16: Proceedings of the forty-eighth annual ACM symposium on Theory of ComputingPages 644–657https://doi.org/10.1145/2897518.2897637

We show that any polynomial-size Frege refutation of a certain linear-size unsatisfiable 3-CNF formula over n variables must have depth Ω(√logn). This is an exponential improvement over the previous best results (Pitassi et al. 1993, Krajíček et al. ...

research-article

Public Access

Near-optimal small-depth lower bounds for small distance connectivity

STOC '16: Proceedings of the forty-eighth annual ACM symposium on Theory of ComputingPages 612–625https://doi.org/10.1145/2897518.2897534

We show that any depth-d circuit for determining whether an n-node graph has an s-to-t path of length at most k must have size n^{Ω(k^1/d/d)} when k(n) ≤ n^1/5, and n^{Ω(k^1/5d/d)} when k(n)≤ n. The previous best circuit size lower bounds were n^{k^exp(−O(d))} (by ...

article

Toward large-scale continuous eda: A random matrix theory perspective

Evolutionary Computation (EVOL), Volume 24, Issue 2Pages 255–291https://doi.org/10.1162/EVCO_a_00150

Estimations of distribution algorithms EDAs are a major branch of evolutionary algorithms EA with some unique advantages in principle. They are able to take advantage of correlation structure to drive the search more efficiently, and they are able to ...

research-article

Soft Content Fingerprinting With Bit Polarization Based on Sign-Magnitude Decomposition

IEEE Transactions on Information Forensics and Security (TIFS), Volume 10, Issue 10Pages 2033–2047https://doi.org/10.1109/TIFS.2015.2432744

Content identification based on digital content fingerprinting attracts significant attention in different emerging applications. In this paper, we consider content identification based on the sign-magnitude decomposition of fingerprint codewords and ...

research-article

A Quantized Johnson–Lindenstrauss Lemma: The Finding of Buffon’s Needle

Laurent Jacques

IEEE Transactions on Information Theory (ITHR), Volume 61, Issue 9Pages 5012–5027https://doi.org/10.1109/TIT.2015.2453355

In 1733, Georges-Louis Leclerc, Comte de Buffon in France, set the ground of geometric probability theory by defining an enlightening problem: what is the probability that a needle thrown randomly on a ground made of equispaced parallel strips lies on two ...

research-article

SATTVA: SpArsiTy inspired classificaTion of malware VAriants

IH&MMSec '15: Proceedings of the 3rd ACM Workshop on Information Hiding and Multimedia SecurityPages 135–140https://doi.org/10.1145/2756601.2756616

There is an alarming increase in the amount of malware that is generated today. However, several studies have shown that most of these new malware are just variants of existing ones. Fast detection of these variants plays an effective role in thwarting ...

research-article

Parallel Streaming Signature EM-tree: A Clustering Algorithm for Web Scale Applications

WWW '15: Proceedings of the 24th International Conference on World Wide WebPages 216–226https://doi.org/10.1145/2736277.2741111

The proliferation of the web presents an unsolved problem of automatically analyzing billions of pages of natural language. We introduce a scalable algorithm that clusters hundreds of millions of web pages into hundreds of thousands of clusters. It does ...

research-article

Solving Linear SVMs with Multiple 1D Projections

CIKM '14: Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge ManagementPages 221–230https://doi.org/10.1145/2661829.2661994

We present a new methodology for solving linear Support Vector Machines (SVMs) that capitalizes on multiple 1D projections. We show that the approach approximates the optimal solution with high accuracy and comes with analytical guarantees. Our solution ...

Article

Distributed Compressive Detection with Perfect Secrecy

MASS '14: Proceedings of the 2014 IEEE 11th International Conference on Mobile Ad Hoc and Sensor SystemsPages 674–679https://doi.org/10.1109/MASS.2014.40

This paper considers the problem of distributed compressive detection under a perfect secrecy constraint. More specifically, we consider the problem where the distributed inference network operates in the presence of an eavesdropper who wants to ...

Article

Using Projection Kurtosis Concentration of Natural Images for Blind Noise Covariance Matrix Estimation

CVPR '14: Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern RecognitionPages 2870–2876https://doi.org/10.1109/CVPR.2014.367

Kurtosis of 1D projections provides important statistical characteristics of natural images. In this work, we first provide a theoretical underpinning to a recently observed phenomenon known as projection kurtosis concentration that the kurtosis of ...

article

Free

Efficient learning and planning with compressed predictive states

The Journal of Machine Learning Research (JMLR), Volume 15, Issue 1Pages 3395–3439

Predictive state representations (PSRs) offer an expressive framework for modelling partially observable systems. By compactly representing systems as functions of observable quantities, the PSR learning approach avoids using local-minima prone ...

Search Results

Applied Filters

People

Names

Institutions

Authors

Publications

Journal/Magazine Names

Proceedings/Book Names

All Publications

Content Type

Media Formats

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

Better Graph Embeddings for Enterprise Graphs

Depth-𝑑 Threshold Circuits vs. Depth-(𝑑+1) AND-OR Trees

Estimating Leverage Scores via Rank Revealing Methods and Randomization

Randomized tests for high-dimensional regression: a more efficient and powerful solution

Sparse projection oblique randomer forests

Upcoming Conferences

Oblivious dimension reduction for k-means: beyond subspaces and the Johnson-Lindenstrauss lemma

Optimal terminal dimensionality reduction in Euclidean space

An Average-Case Depth Hierarchy Theorem for Boolean Circuits

Adaptive randomized dimension reduction on massive data

Poly-logarithmic Frege depth lower bounds via an expander switching lemma

Near-optimal small-depth lower bounds for small distance connectivity

Toward large-scale continuous eda: A random matrix theory perspective

Soft Content Fingerprinting With Bit Polarization Based on Sign-Magnitude Decomposition

A Quantized Johnson–Lindenstrauss Lemma: The Finding of Buffon’s Needle

SATTVA: SpArsiTy inspired classificaTion of malware VAriants

Parallel Streaming Signature EM-tree: A Clustering Algorithm for Web Scale Applications

Solving Linear SVMs with Multiple 1D Projections

Distributed Compressive Detection with Perfect Secrecy

Using Projection Kurtosis Concentration of Natural Images for Blind Noise Covariance Matrix Estimation

Efficient learning and planning with compressed predictive states

Applied Filters

People

Names

Institutions

Authors

Publications

Journal/Magazine Names

Proceedings/Book Names

All Publications

Content Type

Media Formats

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

Save to Binder

Upcoming Conferences