José Angel Daza

José Angel Daza

NLP Engineer

Randstad
550 volgers Meer dan 500 connecties

Info

I am a Computer Scientist mainly interested in Natural Language Processing, Artificial Intelligence and Software Development. I have worked as a technology developer in the industry and also producing my own ideas. As a researcher, I have been working on finding novel methods for processing multilingual data and in the future would be happy to collaborate in more projects that involve exploring and applying algorithms in the NLP and AI fields.

Activiteit

Neem nu deel om alle activiteiten te bekijken

Ervaring

  • Netherlands eScience Center grafisch

    Research Software Engineer

    Netherlands eScience Center

    - heden 1 jaar 1 maand

    Amsterdam, North Holland, Netherlands

  • Vrije Universiteit Amsterdam (VU Amsterdam) grafisch

    NLP Researcher

    Vrije Universiteit Amsterdam (VU Amsterdam)

    - 2 jaar 10 maanden

    Amsterdam, North Holland, Netherlands

  • NLP Software Developer

    Leibniz-Institut für Deutsche Sprache

    - 7 maanden

    Mannheim, Baden-Württemberg, Germany

  • Heidelberg University grafisch

    Researcher Phd Candidate

    Heidelberg University

    - 3 jaar 5 maanden

    Heidelberg Area, Germany

    Thesis: Cross-lingual Semantic Role Labeling through Translation and Multilingual Learning

  • RIKEN grafisch

    Visiting Researcher

    RIKEN

    - 4 maanden

    Tokyo, Japan

    Member of the Language Information Access Technology team.

  • Innomius Technologies grafisch

    Senior Developer

    Innomius Technologies

    - 1 jaar 3 maanden

    Mexico City Area, Mexico

    Implementation of Tableau projects, machine learning, sentiment analysis and NLP software development.

  • Mobile & Web Developer

    Freelance

    - 5 jaar 3 maanden

    Mexico City Area, Mexico

    Mobile App prototype design and development. Code optimization and user interface improvements. Back-end and front-end maintenance.

  • VAWM Co. (Breaking Tech) grafisch

    Cofounder & Prototype Engineer

    VAWM Co. (Breaking Tech)

    - 1 jaar 1 maand

  • Management Solutions grafisch

    Assistant

    Management Solutions

    - 9 maanden

    Automatic executive report generation software. Risk Management databases exploitation. Improvement on Credit-card system performance and documentation.

  • Harweb grafisch

    Innovation & New Technologies Analyst

    Harweb

    - 1 jaar

    Research for new technologies and its feasibility for integration with the current software. Web-based programming: Javascript, j-query plugins integration, and some C# programming on the server side. Also in charge of the iPad mobile App based in Appcelerator Titanium. Design and implementation of Web-Services.

Opleiding

Licenties en certificaten

Publicaties

  • X-SRL: A Parallel Cross-Lingual Semantic Role Labeling Corpus

    EMNLP 2020

    Even though SRL is researched for many languages, major improvements have mostly been obtained for English, for which more resources are available. In fact, existing multilingual SRL datasets contain disparate annotation styles or come from different domains, hampering generalization in multilingual learning. In this work we propose a method to automatically construct an SRL corpus that is parallel in four languages: English, French, German, Spanish, with unified predicate and role annotations…

    Even though SRL is researched for many languages, major improvements have mostly been obtained for English, for which more resources are available. In fact, existing multilingual SRL datasets contain disparate annotation styles or come from different domains, hampering generalization in multilingual learning. In this work we propose a method to automatically construct an SRL corpus that is parallel in four languages: English, French, German, Spanish, with unified predicate and role annotations that are fully comparable across languages. We apply high-quality machine translation to the English CoNLL-09 dataset and use multilingual BERT to project its high-quality annotations to the target languages. We include human-validated test sets that we use to measure the projection quality, and show that projection is denser and more precise than a strong baseline. Finally, we train different SOTA models on our novel corpus for mono- and multilingual SRL, showing that the multi-lingual annotations improve performance especially for the weaker languages.

    Publicatie weergeven
  • Translate and Label! An Encoder-Decoder Approach for Cross-lingual Semantic Role Labeling

    Conference on Empirical Methods in Natural Language Processing - EMNLP-IJCNLP 2019

  • A Sequence-to-Sequence Model for Semantic Role Labeling

    Proceedings of The Third Workshop on Representation Learning for NLP. ACL 2018

    We explore a novel approach for Semantic Role Labeling (SRL) by casting it as a sequence-to-sequence process. We employ an attention-based model enriched with a copying mechanism to ensure faithful regeneration of the input sequence, while enabling interleaved generation of argument role labels. Here, we apply this model in a monolingual setting, performing PropBank SRL on English language data. The constrained sequence generation set-up enforced with the copying mechanism allows us to analyze…

    We explore a novel approach for Semantic Role Labeling (SRL) by casting it as a sequence-to-sequence process. We employ an attention-based model enriched with a copying mechanism to ensure faithful regeneration of the input sequence, while enabling interleaved generation of argument role labels. Here, we apply this model in a monolingual setting, performing PropBank SRL on English language data. The constrained sequence generation set-up enforced with the copying mechanism allows us to analyze the performance and special properties of the model on manually labeled data and benchmarking against state-of-the-art sequence labeling models. We show that our model is able to solve the SRL argument labeling task on English data, yet further structural decoding constraints will need to be added to make the model truly competitive. Our work represents a first step towards more advanced, generative SRL labeling setups.

    Publicatie weergeven
  • Automatic Story Generation by Learning from Literary Structures

    Scholars'​ Press

    Are mind and machine capable of solving the same tasks? Creativity is one of the arguments that some philosophers and psychologists use as a proof of what computers cannot achieve; however, these arguments might be based on a misconception of what both intelligence and creativity mean. This book provides arguments supporting that creativity, as storytelling, can be emulated through computer programs. The assumption of creativity presents a major problem: Complexity. Even if we consider…

    Are mind and machine capable of solving the same tasks? Creativity is one of the arguments that some philosophers and psychologists use as a proof of what computers cannot achieve; however, these arguments might be based on a misconception of what both intelligence and creativity mean. This book provides arguments supporting that creativity, as storytelling, can be emulated through computer programs. The assumption of creativity presents a major problem: Complexity. Even if we consider creativity just as a product of novel ways of achieving a goal, the number of combinations found when dealing with the ‘real world’ is astronomically huge. We can recall The Library of Babel (Borges, 1944), a library that contains any possible book that could be written in the history of humanity. This metaphor reveals the combinatory problem that emerges if a brute force algorithm is designed to generate texts. According to our hypothesis, our proposal is a heuristic that uses simple syntactic and semantic properties found in a text corpus in order to generate novel and coherent fiction texts based on what has been already written.

    Andere auteurs
    Publicatie weergeven
  • Automatic Text Generation by Learning from Literary Structures

    Proceedings of the Fifth Workshop on Computational Linguistics for Literature, NAACL-HLT 2016

    Most of the work dealing with automatic story production is based on a generic architecture for text generation; however, the resulting stories still lack a style that can be called literary. We believe that in order to generate automatically stories that could be compared with those by human authors, a specific methodology for fiction text generation should be defined. We also believe that it is essential for a story to convey the effect of originality to the person who is reading it. Our…

    Most of the work dealing with automatic story production is based on a generic architecture for text generation; however, the resulting stories still lack a style that can be called literary. We believe that in order to generate automatically stories that could be compared with those by human authors, a specific methodology for fiction text generation should be defined. We also believe that it is essential for a story to convey the effect of originality to the person who is reading it. Our methodology proposes corpus-based generation of stories that could be called creative and also have a style similar to human fiction texts. We also show how these stories have plausible syntax and coherence, and are perceived as interesting by human evaluators.

    Andere auteurs
    Publicatie weergeven

Cursussen

  • Algorithm Analysis and Design

    -

  • Artificial Intelligence

    -

  • Computer Systems Design

    -

  • Discrete Mathematics

    -

  • Natural Language Generation

    -

  • Natural Language Processing

    -

  • Pattern Recognition

    -

  • Statistical Processing of Textual Information

    -

  • Theory of Computation

    -

Projecten

  • CuantoCobrar

    - heden

    iOS App that calculates the price of freelancing projects.

    Andere bijdragers
    Project weergeven
  • InTaVia: Visual Analysis, Curation & Communication for In/Tangible European Heritage

    -

    The InTaVia knowledge graph contains data on Europe’s cultural history, including data on individual artists, cultural objects, and groups or organizations. Search for these entities in our knowledge base (with a focus on Slovenia, Austria, the Netherlands and Finland) or with a global reach via data from Wikipedia. You can also upload your own data, and curate (i.e., edit, assemble, or enrich) all kinds of data for further operations of visual analysis and narration.

Testscores

  • TOEFL-iBT

    Score: 101

Talen

  • Spanish

    Moedertaal of tweetalig

  • English

    Volledige professionele vaardigheid

  • German

    Basisvaardigheid

  • Portuguese

    Beperkte werkvaardigheid

Meer activiteiten van José Angel

Bekijk het volledige profiel van José Angel

  • Bekijk wie u allebei kent
  • Word voorgesteld
  • Neem rechtstreeks contact op met José Angel
Word lid en bekijk het volledige profiel

Overige vergelijkbare profielen

Anderen hebben José Angel Daza genoemd

Voeg nieuwe vaardigheden toe met behulp van deze cursussen