Skip to main content

Showing 1–28 of 28 results for author: Elidan, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.12878  [pdf, other

    cs.CL cs.AI

    Do LLMs have Consistent Values?

    Authors: Naama Rozen, Gal Elidan, Amir Globerson, Ella Daniel

    Abstract: Values are a basic driving force underlying human behavior. Large Language Models (LLM) technology is constantly improving towards human-like dialogue. However, little research has been done to study the values exhibited in text generated by LLMs. Here we study this question by turning to the rich literature on value structure in psychology. We ask whether LLMs exhibit the same value structure tha… ▽ More

    Submitted 19 July, 2024; v1 submitted 16 July, 2024; originally announced July 2024.

    Comments: 10 pages, 5 figures, and there are more in the appendix

  2. arXiv:2407.12687  [pdf, other

    cs.CY cs.AI cs.LG

    Towards Responsible Development of Generative AI for Education: An Evaluation-Driven Approach

    Authors: Irina Jurenka, Markus Kunesch, Kevin R. McKee, Daniel Gillick, Shaojian Zhu, Sara Wiltberger, Shubham Milind Phal, Katherine Hermann, Daniel Kasenberg, Avishkar Bhoopchand, Ankit Anand, Miruna Pîslar, Stephanie Chan, Lisa Wang, Jennifer She, Parsa Mahmoudieh, Aliya Rysbek, Wei-Jen Ko, Andrea Huber, Brett Wiltshire, Gal Elidan, Roni Rabin, Jasmin Rubinovitz, Amit Pitaru, Mac McAllister , et al. (49 additional authors not shown)

    Abstract: A major challenge facing the world is the provision of equitable and universal access to quality education. Recent advances in generative AI (gen AI) have created excitement about the potential of new technologies to offer a personal tutor for every learner and a teaching assistant for every teacher. The full extent of this dream, however, has not yet materialised. We argue that this is primarily… ▽ More

    Submitted 19 July, 2024; v1 submitted 21 May, 2024; originally announced July 2024.

  3. arXiv:2406.03618  [pdf, other

    cs.CL

    TACT: Advancing Complex Aggregative Reasoning with Information Extraction Tools

    Authors: Avi Caciularu, Alon Jacovi, Eyal Ben-David, Sasha Goldshtein, Tal Schuster, Jonathan Herzig, Gal Elidan, Amir Globerson

    Abstract: Large Language Models (LLMs) often do not perform well on queries that require the aggregation of information across texts. To better evaluate this setting and facilitate modeling efforts, we introduce TACT - Text And Calculations through Tables, a dataset crafted to evaluate LLMs' reasoning and computational abilities using complex instructions. TACT contains challenging instructions that demand… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: Website (https://tact-benchmark.github.io), Huggingface (https://huggingface.co/datasets/google/TACT)

  4. arXiv:2307.03319  [pdf, other

    cs.CL

    Covering Uncommon Ground: Gap-Focused Question Generation for Answer Assessment

    Authors: Roni Rabin, Alexandre Djerbetian, Roee Engelberg, Lidan Hackmon, Gal Elidan, Reut Tsarfaty, Amir Globerson

    Abstract: Human communication often involves information gaps between the interlocutors. For example, in an educational dialogue, a student often provides an answer that is incomplete, and there is a gap between this answer and the perfect one expected by the teacher. Successful dialogue then hinges on the teacher asking about this gap in an effective manner, thus creating a rich and interactive educational… ▽ More

    Submitted 6 July, 2023; originally announced July 2023.

  5. arXiv:2306.00186  [pdf, other

    cs.CL

    Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback

    Authors: Paul Roit, Johan Ferret, Lior Shani, Roee Aharoni, Geoffrey Cideron, Robert Dadashi, Matthieu Geist, Sertan Girgin, Léonard Hussenot, Orgad Keller, Nikola Momchev, Sabela Ramos, Piotr Stanczyk, Nino Vieillard, Olivier Bachem, Gal Elidan, Avinatan Hassidim, Olivier Pietquin, Idan Szpektor

    Abstract: Despite the seeming success of contemporary grounded text generation systems, they often tend to generate factually inconsistent text with respect to their input. This phenomenon is emphasized in tasks like summarization, in which the generated summaries should be corroborated by their source article. In this work, we leverage recent progress on textual entailment models to directly address this p… ▽ More

    Submitted 31 May, 2023; originally announced June 2023.

    Comments: ACL 2023

  6. arXiv:2208.02294  [pdf, other

    cs.CL cs.LG

    Dynamic Planning in Open-Ended Dialogue using Reinforcement Learning

    Authors: Deborah Cohen, Moonkyung Ryu, Yinlam Chow, Orgad Keller, Ido Greenberg, Avinatan Hassidim, Michael Fink, Yossi Matias, Idan Szpektor, Craig Boutilier, Gal Elidan

    Abstract: Despite recent advances in natural language understanding and generation, and decades of research on the development of conversational bots, building automated agents that can carry on rich open-ended conversations with humans "in the wild" remains a formidable challenge. In this work we develop a real-time, open-ended dialogue system that uses reinforcement learning (RL) to power a bot's conversa… ▽ More

    Submitted 25 July, 2022; originally announced August 2022.

  7. arXiv:2204.04670  [pdf, other

    cs.LG

    Active Learning with Label Comparisons

    Authors: Gal Yona, Shay Moran, Gal Elidan, Amir Globerson

    Abstract: Supervised learning typically relies on manual annotation of the true labels. When there are many potential classes, searching for the best one can be prohibitive for a human annotator. On the other hand, comparing two candidate labels is often much easier. We focus on this type of pairwise supervision and ask how it can be used effectively in learning, and in particular in active learning. We obt… ▽ More

    Submitted 14 August, 2022; v1 submitted 10 April, 2022; originally announced April 2022.

    Comments: Appeared in the conference on Uncertainty in AI (UAI), 2022

  8. arXiv:2111.02780  [pdf

    cs.LG

    Flood forecasting with machine learning models in an operational framework

    Authors: Sella Nevo, Efrat Morin, Adi Gerzi Rosenthal, Asher Metzger, Chen Barshai, Dana Weitzner, Dafi Voloshin, Frederik Kratzert, Gal Elidan, Gideon Dror, Gregory Begelman, Grey Nearing, Guy Shalev, Hila Noga, Ira Shavitt, Liora Yuklea, Moriah Royz, Niv Giladi, Nofar Peled Levi, Ofir Reich, Oren Gilon, Ronnie Maor, Shahar Timnat, Tal Shechter, Vladimir Anisimov , et al. (6 additional authors not shown)

    Abstract: The operational flood forecasting system by Google was developed to provide accurate real-time flood warnings to agencies and the public, with a focus on riverine floods in large, gauged rivers. It became operational in 2018 and has since expanded geographically. This forecasting system consists of four subsystems: data validation, stage forecasting, inundation modeling, and alert distribution. Ma… ▽ More

    Submitted 4 November, 2021; originally announced November 2021.

    Comments: 36 pages, 10 figures, 3 tables, 1 supplementary table (9 pages)

  9. arXiv:2105.01904  [pdf, other

    cs.LG cs.AI

    Solving Sokoban with forward-backward reinforcement learning

    Authors: Yaron Shoham, Gal Elidan

    Abstract: Despite seminal advances in reinforcement learning in recent years, many domains where the rewards are sparse, e.g. given only at task completion, remain quite challenging. In such cases, it can be beneficial to tackle the task both from its beginning and end, and make the two ends meet. Existing approaches that do so, however, are not effective in the common scenario where the strategy needed nea… ▽ More

    Submitted 22 May, 2021; v1 submitted 5 May, 2021; originally announced May 2021.

    Comments: To be published in SoCS 2021

    ACM Class: I.2.6; I.2.8

  10. arXiv:2104.13369  [pdf, other

    cs.CV cs.LG cs.NE eess.IV stat.ML

    Explaining in Style: Training a GAN to explain a classifier in StyleSpace

    Authors: Oran Lang, Yossi Gandelsman, Michal Yarom, Yoav Wald, Gal Elidan, Avinatan Hassidim, William T. Freeman, Phillip Isola, Amir Globerson, Michal Irani, Inbar Mosseri

    Abstract: Image classification models can depend on multiple different semantic attributes of the image. An explanation of the decision of the classifier needs to both discover and visualize these properties. Here we present StylEx, a method for doing this, by training a generative model to specifically explain multiple attributes that underlie classifier decisions. A natural source for such attributes is t… ▽ More

    Submitted 1 September, 2021; v1 submitted 27 April, 2021; originally announced April 2021.

    Comments: Accepted to ICCV 2021. Project page: https://explaining-in-style.github.io/, Code: https://github.com/google/explaining-in-style

  11. arXiv:2012.00671  [pdf, other

    physics.ao-ph cs.LG

    ML-based Flood Forecasting: Advances in Scale, Accuracy and Reach

    Authors: Sella Nevo, Gal Elidan, Avinatan Hassidim, Guy Shalev, Oren Gilon, Grey Nearing, Yossi Matias

    Abstract: Floods are among the most common and deadly natural disasters in the world, and flood warning systems have been shown to be effective in reducing harm. Yet the majority of the world's vulnerable population does not have access to reliable and actionable warning systems, due to core challenges in scalability, computational costs, and data availability. In this paper we present two components of flo… ▽ More

    Submitted 5 December, 2020; v1 submitted 29 November, 2020; originally announced December 2020.

    Comments: Submitted/accepted to NeurIPS HADR workshop: https://www.hadr.ai/home

  12. arXiv:2007.00595  [pdf, other

    cs.LG stat.ML

    HydroNets: Leveraging River Structure for Hydrologic Modeling

    Authors: Zach Moshe, Asher Metzger, Gal Elidan, Frederik Kratzert, Sella Nevo, Ran El-Yaniv

    Abstract: Accurate and scalable hydrologic models are essential building blocks of several important applications, from water resource management to timely flood warnings. However, as the climate changes, precipitation and rainfall-runoff pattern variations become more extreme, and accurate training data that can account for the resulting distributional shifts become more scarce. In this work we present a n… ▽ More

    Submitted 1 July, 2020; originally announced July 2020.

    Comments: Presented in the "AI for physical sciences" workshop, ICLR2020 (https://ai4earthscience.github.io/iclr-2020-workshop/)

  13. arXiv:2006.06465  [pdf, other

    cs.LG stat.ML

    DNF-Net: A Neural Architecture for Tabular Data

    Authors: Ami Abutbul, Gal Elidan, Liran Katzir, Ran El-Yaniv

    Abstract: A challenging open question in deep learning is how to handle tabular data. Unlike domains such as image and natural language processing, where deep architectures prevail, there is still no widely accepted neural architecture that dominates tabular data. As a step toward bridging this gap, we present DNF-Net a novel generic architecture whose inductive bias elicits models whose structure correspon… ▽ More

    Submitted 11 June, 2020; originally announced June 2020.

  14. arXiv:2004.10255  [pdf, other

    stat.ML cs.LG eess.SP

    Convex Nonparanormal Regression

    Authors: Yonatan Woodbridge, Gal Elidan, Ami Wiesel

    Abstract: Quantifying uncertainty in predictions or, more generally, estimating the posterior conditional distribution, is a core challenge in machine learning and statistics. We introduce Convex Nonparanormal Regression (CNR), a conditional nonparanormal approach for coping with this task. CNR involves a convex optimization of a posterior defined via a rich dictionary of pre-defined non linear transformati… ▽ More

    Submitted 4 April, 2021; v1 submitted 21 April, 2020; originally announced April 2020.

  15. arXiv:1911.00870  [pdf, other

    cs.LG cs.CR cs.CV stat.ML

    MadNet: Using a MAD Optimization for Defending Against Adversarial Attacks

    Authors: Shai Rozenberg, Gal Elidan, Ran El-Yaniv

    Abstract: This paper is concerned with the defense of deep models against adversarial attacks. Inspired by the certificate defense approach, we propose a maximal adversarial distortion (MAD) optimization method for robustifying deep networks. MAD captures the idea of increasing separability of class clusters in the embedding space while decreasing the network sensitivity to small distortions. Given a deep n… ▽ More

    Submitted 12 June, 2020; v1 submitted 3 November, 2019; originally announced November 2019.

  16. arXiv:1910.12204  [pdf, other

    cs.LG stat.ML

    Spectral Algorithm for Low-rank Multitask Regression

    Authors: Yotam Gigi, Ami Wiesel, Sella Nevo, Gal Elidan, Avinatan Hassidim, Yossi Matias

    Abstract: Multitask learning, i.e. taking advantage of the relatedness of individual tasks in order to improve performance on all of them, is a core challenge in the field of machine learning. We focus on matrix regression tasks where the rank of the weight matrix is constrained to reduce sample complexity. We introduce the common mechanism regression (CMR) model which assumes a shared left low-rank compone… ▽ More

    Submitted 27 October, 2019; originally announced October 2019.

  17. arXiv:1901.09583  [pdf, other

    cs.LG stat.ML

    ML for Flood Forecasting at Scale

    Authors: Sella Nevo, Vova Anisimov, Gal Elidan, Ran El-Yaniv, Pete Giencke, Yotam Gigi, Avinatan Hassidim, Zach Moshe, Mor Schlesinger, Guy Shalev, Ajai Tirumali, Ami Wiesel, Oleg Zlydenko, Yossi Matias

    Abstract: Effective riverine flood forecasting at scale is hindered by a multitude of factors, most notably the need to rely on human calibration in current methodology, the limited amount of data for a specific location, and the computational difficulty of building continent/global level models that are sufficiently accurate. Machine learning (ML) is primed to be useful in this scenario: learned models oft… ▽ More

    Submitted 28 January, 2019; originally announced January 2019.

    Comments: The 2-page paper sent to NeurIPS 2018 AI for social good workshop

  18. arXiv:1901.00786  [pdf, other

    cs.LG stat.ML

    Towards Global Remote Discharge Estimation: Using the Few to Estimate The Many

    Authors: Yotam Gigi, Gal Elidan, Avinatan Hassidim, Yossi Matias, Zach Moshe, Sella Nevo, Guy Shalev, Ami Wiesel

    Abstract: Learning hydrologic models for accurate riverine flood prediction at scale is a challenge of great importance. One of the key difficulties is the need to rely on in-situ river discharge measurements, which can be quite scarce and unreliable, particularly in regions where floods cause the most damage every year. Accordingly, in this work we tackle the problem of river discharge estimation at differ… ▽ More

    Submitted 3 January, 2019; originally announced January 2019.

    Comments: The 4-page paper sent to NeurIPS 2018 AI for social good workshop

  19. arXiv:1803.03155  [pdf, other

    cs.LG

    Learning Rules-First Classifiers

    Authors: Deborah Cohen, Amit Daniely, Amir Globerson, Gal Elidan

    Abstract: Complex classifiers may exhibit "embarassing" failures in cases where humans can easily provide a justified classification. Avoiding such failures is obviously of key importance. In this work, we focus on one such setting, where a label is perfectly predictable if the input contains certain features, or rules, and otherwise it is predictable by a linear classifier. We define a hypothesis class tha… ▽ More

    Submitted 13 June, 2019; v1 submitted 8 March, 2018; originally announced March 2018.

  20. arXiv:1608.04802  [pdf, other

    stat.ML cs.LG

    Scalable Learning of Non-Decomposable Objectives

    Authors: Elad ET. Eban, Mariano Schain, Alan Mackey, Ariel Gordon, Rif A. Saurous, Gal Elidan

    Abstract: Modern retrieval systems are often driven by an underlying machine learning model. The goal of such systems is to identify and possibly rank the few most relevant items for a given query or context. Thus, such systems are typically evaluated using a ranking-based performance metric such as the area under the precision-recall curve, the $F_β$ score, precision at fixed recall, etc. Obviously, it is… ▽ More

    Submitted 1 March, 2017; v1 submitted 16 August, 2016; originally announced August 2016.

  21. arXiv:1309.6867  [pdf

    cs.LG stat.ME

    Speedy Model Selection (SMS) for Copula Models

    Authors: Yaniv Tenzer, Gal Elidan

    Abstract: We tackle the challenge of efficiently learning the structure of expressive multivariate real-valued densities of copula graphical models. We start by theoretically substantiating the conjecture that for many copula families the magnitude of Spearman's rank correlation coefficient is monotone in the expected contribution of an edge in network, namely the negative copula entropy. We then build on t… ▽ More

    Submitted 26 September, 2013; originally announced September 2013.

    Comments: Appears in Proceedings of the Twenty-Ninth Conference on Uncertainty in Artificial Intelligence (UAI2013)

    Report number: UAI-P-2013-PG-625-634

  22. arXiv:1309.6847  [pdf

    cs.LG stat.ML

    Learning Max-Margin Tree Predictors

    Authors: Ofer Meshi, Elad Eban, Gal Elidan, Amir Globerson

    Abstract: Structured prediction is a powerful framework for coping with joint prediction of interacting outputs. A central difficulty in using this framework is that often the correct label dependence structure is unknown. At the same time, we would like to avoid an overly complex structure that will lead to intractable prediction. In this work we address the challenge of learning tree structured predictive… ▽ More

    Submitted 26 September, 2013; originally announced September 2013.

    Comments: Appears in Proceedings of the Twenty-Ninth Conference on Uncertainty in Artificial Intelligence (UAI2013)

    Report number: UAI-P-2013-PG-411-420

  23. arXiv:1301.2269  [pdf

    cs.LG cs.AI stat.ML

    Learning the Dimensionality of Hidden Variables

    Authors: Gal Elidan, Nir Friedman

    Abstract: A serious problem in learning probabilistic models is the presence of hidden variables. These variables are not observed, yet interact with several of the observed variables. Detecting hidden variables poses two problems: determining the relations to other variables in the model and determining the number of states of the hidden variable. In this paper, we address the latter problem in the context… ▽ More

    Submitted 10 January, 2013; originally announced January 2013.

    Comments: Appears in Proceedings of the Seventeenth Conference on Uncertainty in Artificial Intelligence (UAI2001)

    Report number: UAI-P-2001-PG-144-151

  24. arXiv:1212.2460  [pdf

    cs.LG stat.ML

    The Information Bottleneck EM Algorithm

    Authors: Gal Elidan, Nir Friedman

    Abstract: Learning with hidden variables is a central challenge in probabilistic graphical models that has important implications for many real-life problems. The classical approach is using the Expectation Maximization (EM) algorithm. This algorithm, however, can get trapped in local maxima. In this paper we explore a new approach that is based on the Information Bottleneck principle. In this approach, we… ▽ More

    Submitted 19 October, 2012; originally announced December 2012.

    Comments: Appears in Proceedings of the Nineteenth Conference on Uncertainty in Artificial Intelligence (UAI2003)

    Report number: UAI-P-2003-PG-200-208

  25. arXiv:1207.4133  [pdf

    cs.LG stat.ML

    "Ideal Parent" Structure Learning for Continuous Variable Networks

    Authors: Iftach Nachman, Gal Elidan, Nir Friedman

    Abstract: In recent years, there is a growing interest in learning Bayesian networks with continuous variables. Learning the structure of such networks is a computationally expensive procedure, which limits most applications to parameter learning. This problem is even more acute when learning networks with hidden variables. We present a general method for significantly speeding the structure search algorith… ▽ More

    Submitted 11 July, 2012; originally announced July 2012.

    Comments: Appears in Proceedings of the Twentieth Conference on Uncertainty in Artificial Intelligence (UAI2004)

    Report number: UAI-P-2004-PG-400-409

  26. arXiv:1206.6837  [pdf

    cs.AI

    Residual Belief Propagation: Informed Scheduling for Asynchronous Message Passing

    Authors: Gal Elidan, Ian McGraw, Daphne Koller

    Abstract: Inference for probabilistic graphical models is still very much a practical challenge in large domains. The commonly used and effective belief propagation (BP) algorithm and its generalizations often do not converge when applied to hard, real-life inference tasks. While it is widely recognized that the scheduling of messages in these algorithms may have significant consequences, this issue remains… ▽ More

    Submitted 27 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the Twenty-Second Conference on Uncertainty in Artificial Intelligence (UAI2006)

    Report number: UAI-P-2006-PG-165-173

  27. arXiv:1206.3252  [pdf

    cs.LG stat.ML

    Convex Point Estimation using Undirected Bayesian Transfer Hierarchies

    Authors: Gal Elidan, Ben Packer, Geremy Heitz, Daphne Koller

    Abstract: When related learning tasks are naturally arranged in a hierarchy, an appealing approach for coping with scarcity of instances is that of transfer learning using a hierarchical Bayes framework. As fully Bayesian computations can be difficult and computationally demanding, it is often desirable to use posterior point estimates that facilitate (relatively) efficient prediction. However, the hierarch… ▽ More

    Submitted 13 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the Twenty-Fourth Conference on Uncertainty in Artificial Intelligence (UAI2008)

    Report number: UAI-P-2008-PG-179-187

  28. arXiv:1203.3476  [pdf

    cs.LG stat.ML

    Inference-less Density Estimation using Copula Bayesian Networks

    Authors: Gal Elidan

    Abstract: We consider learning continuous probabilistic graphical models in the face of missing data. For non-Gaussian models, learning the parameters and structure of such models depends on our ability to perform efficient inference, and can be prohibitive even for relatively modest domains. Recently, we introduced the Copula Bayesian Network (CBN) density model - a flexible framework that captures complex… ▽ More

    Submitted 15 March, 2012; originally announced March 2012.

    Comments: Appears in Proceedings of the Twenty-Sixth Conference on Uncertainty in Artificial Intelligence (UAI2010)

    Report number: UAI-P-2010-PG-151-159