Skip to main content

Showing 1–12 of 12 results for author: Cummings, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.07957  [pdf, ps, other

    eess.IV cs.CV cs.LG cs.SD eess.AS

    Machine Perceptual Quality: Evaluating the Impact of Severe Lossy Compression on Audio and Image Models

    Authors: Dan Jacobellis, Daniel Cummings, Neeraja J. Yadwadkar

    Abstract: In the field of neural data compression, the prevailing focus has been on optimizing algorithms for either classical distortion metrics, such as PSNR or SSIM, or human perceptual quality. With increasing amounts of data consumed by machines rather than humans, a new paradigm of machine-oriented compression$\unicode{x2013}$which prioritizes the retention of features salient for machine perception o… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

    Comments: 10 pages; abridged version published in IEEE Data Compression Conference 2024

  2. arXiv:2312.05978  [pdf, other

    cs.LG

    Neural Architecture Codesign for Fast Bragg Peak Analysis

    Authors: Luke McDermott, Jason Weitz, Dmitri Demler, Daniel Cummings, Nhan Tran, Javier Duarte

    Abstract: We develop an automated pipeline to streamline neural architecture codesign for fast, real-time Bragg peak analysis in high-energy diffraction microscopy. Traditional approaches, notably pseudo-Voigt fitting, demand significant computational resources, prompting interest in deep learning models for more efficient solutions. Our method employs neural architecture search and AutoML to enhance these… ▽ More

    Submitted 11 December, 2023; v1 submitted 10 December, 2023; originally announced December 2023.

    Comments: To appear in 3rd Annual AAAI Workshop on AI to Accelerate Science and Engineering (AI2ASE)

    Report number: FERMILAB-CONF-23-0813-CSAID-PPD

  3. arXiv:2310.18812  [pdf, other

    cs.CV

    UniCat: Crafting a Stronger Fusion Baseline for Multimodal Re-Identification

    Authors: Jennifer Crawford, Haoli Yin, Luke McDermott, Daniel Cummings

    Abstract: Multimodal Re-Identification (ReID) is a popular retrieval task that aims to re-identify objects across diverse data streams, prompting many researchers to integrate multiple modalities into a unified representation. While such fusion promises a holistic view, our investigations shed light on potential pitfalls. We uncover that prevailing late-fusion techniques often produce suboptimal latent repr… ▽ More

    Submitted 28 October, 2023; originally announced October 2023.

    Comments: Accepted NeurIPS 2023 UniReps, 9 pages, 4 tables

  4. arXiv:2310.18769  [pdf, other

    cs.LG cs.AI

    Linear Mode Connectivity in Sparse Neural Networks

    Authors: Luke McDermott, Daniel Cummings

    Abstract: With the rise in interest of sparse neural networks, we study how neural network pruning with synthetic data leads to sparse networks with unique training properties. We find that distilled data, a synthetic summarization of the real data, paired with Iterative Magnitude Pruning (IMP) unveils a new class of sparse networks that are more stable to SGD noise on the real data, than either the dense m… ▽ More

    Submitted 28 October, 2023; originally announced October 2023.

    Comments: Published in NeurIPS 2023 UniReps Workshop

  5. arXiv:2310.16856  [pdf, other

    cs.CV

    GraFT: Gradual Fusion Transformer for Multimodal Re-Identification

    Authors: Haoli Yin, Jiayao Li, Eva Schiller, Luke McDermott, Daniel Cummings

    Abstract: Object Re-Identification (ReID) is pivotal in computer vision, witnessing an escalating demand for adept multimodal representation learning. Current models, although promising, reveal scalability limitations with increasing modalities as they rely heavily on late fusion, which postpones the integration of specific modality insights. Addressing this, we introduce the \textbf{Gradual Fusion Transfor… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: 3 Borderline Reviews at WACV, 8 pages, 5 figures, 8 tables

  6. arXiv:2307.03364  [pdf, other

    cs.LG

    Distilled Pruning: Using Synthetic Data to Win the Lottery

    Authors: Luke McDermott, Daniel Cummings

    Abstract: This work introduces a novel approach to pruning deep learning models by using distilled data. Unlike conventional strategies which primarily focus on architectural or algorithmic optimization, our method reconsiders the role of data in these scenarios. Distilled datasets capture essential patterns from larger datasets, and we demonstrate how to leverage this capability to enable a computationally… ▽ More

    Submitted 8 August, 2023; v1 submitted 6 July, 2023; originally announced July 2023.

  7. arXiv:2303.08774  [pdf, other

    cs.CL cs.AI

    GPT-4 Technical Report

    Authors: OpenAI, Josh Achiam, Steven Adler, Sandhini Agarwal, Lama Ahmad, Ilge Akkaya, Florencia Leoni Aleman, Diogo Almeida, Janko Altenschmidt, Sam Altman, Shyamal Anadkat, Red Avila, Igor Babuschkin, Suchir Balaji, Valerie Balcom, Paul Baltescu, Haiming Bao, Mohammad Bavarian, Jeff Belgum, Irwan Bello, Jake Berdine, Gabriel Bernadett-Shapiro, Christopher Berner, Lenny Bogdonoff, Oleg Boiko , et al. (256 additional authors not shown)

    Abstract: We report the development of GPT-4, a large-scale, multimodal model which can accept image and text inputs and produce text outputs. While less capable than humans in many real-world scenarios, GPT-4 exhibits human-level performance on various professional and academic benchmarks, including passing a simulated bar exam with a score around the top 10% of test takers. GPT-4 is a Transformer-based mo… ▽ More

    Submitted 4 March, 2024; v1 submitted 15 March, 2023; originally announced March 2023.

    Comments: 100 pages; updated authors list; fixed author names and added citation

  8. arXiv:2205.10358  [pdf, other

    cs.LG cs.NE

    A Hardware-Aware Framework for Accelerating Neural Architecture Search Across Modalities

    Authors: Daniel Cummings, Anthony Sarah, Sharath Nittur Sridhar, Maciej Szankin, Juan Pablo Munoz, Sairam Sundaresan

    Abstract: Recent advances in Neural Architecture Search (NAS) such as one-shot NAS offer the ability to extract specialized hardware-aware sub-network configurations from a task-specific super-network. While considerable effort has been employed towards improving the first stage, namely, the training of the super-network, the search for derivative high-performing sub-networks is still under-explored. Popula… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

  9. arXiv:2202.12954  [pdf, other

    cs.AI

    A Hardware-Aware System for Accelerating Deep Neural Network Optimization

    Authors: Anthony Sarah, Daniel Cummings, Sharath Nittur Sridhar, Sairam Sundaresan, Maciej Szankin, Tristan Webb, J. Pablo Munoz

    Abstract: Recent advances in Neural Architecture Search (NAS) which extract specialized hardware-aware configurations (a.k.a. "sub-networks") from a hardware-agnostic "super-network" have become increasingly popular. While considerable effort has been employed towards improving the first stage, namely, the training of the super-network, the search for derivative high-performing sub-networks is still largely… ▽ More

    Submitted 25 February, 2022; originally announced February 2022.

  10. arXiv:2202.12934  [pdf, other

    cs.NE

    Accelerating Neural Architecture Exploration Across Modalities Using Genetic Algorithms

    Authors: Daniel Cummings, Sharath Nittur Sridhar, Anthony Sarah, Maciej Szankin

    Abstract: Neural architecture search (NAS), the study of automating the discovery of optimal deep neural network architectures for tasks in domains such as computer vision and natural language processing, has seen rapid growth in the machine learning research community. While there have been many recent advancements in NAS, there is still a significant focus on reducing the computational cost incurred when… ▽ More

    Submitted 25 February, 2022; originally announced February 2022.

  11. arXiv:2107.03374  [pdf, other

    cs.LG

    Evaluating Large Language Models Trained on Code

    Authors: Mark Chen, Jerry Tworek, Heewoo Jun, Qiming Yuan, Henrique Ponde de Oliveira Pinto, Jared Kaplan, Harri Edwards, Yuri Burda, Nicholas Joseph, Greg Brockman, Alex Ray, Raul Puri, Gretchen Krueger, Michael Petrov, Heidy Khlaaf, Girish Sastry, Pamela Mishkin, Brooke Chan, Scott Gray, Nick Ryder, Mikhail Pavlov, Alethea Power, Lukasz Kaiser, Mohammad Bavarian, Clemens Winter , et al. (33 additional authors not shown)

    Abstract: We introduce Codex, a GPT language model fine-tuned on publicly available code from GitHub, and study its Python code-writing capabilities. A distinct production version of Codex powers GitHub Copilot. On HumanEval, a new evaluation set we release to measure functional correctness for synthesizing programs from docstrings, our model solves 28.8% of the problems, while GPT-3 solves 0% and GPT-J sol… ▽ More

    Submitted 14 July, 2021; v1 submitted 7 July, 2021; originally announced July 2021.

    Comments: corrected typos, added references, added authors, added acknowledgements

  12. Structured Citation Trend Prediction Using Graph Neural Networks

    Authors: Daniel Cummings, Marcel Nassar

    Abstract: Academic citation graphs represent citation relationships between publications across the full range of academic fields. Top cited papers typically reveal future trends in their corresponding domains which is of importance to both researchers and practitioners. Prior citation prediction methods often require initial citation trends to be established and do not take advantage of the recent advancem… ▽ More

    Submitted 6 April, 2021; originally announced April 2021.

    Comments: Appeared in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2020. 5 pages, 5 figures