default search action

combined dblp search
author search
venue search
publication search

ask others

Florian Metze

> Home > Persons

Person information

affiliation: Carnegie Mellon University, Pittsburgh, USA

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c215]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MichaelsLYYWM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MichaelsLYYWM24
Jackson Michaels, Juncheng B. Li, Laura Yao, Lijun Yu, Zach Wood-Doughty, Florian Metze:
Audio-Journey: Open Domain Latent Diffusion Based Text-To-Audio Generation. ICASSP 2024: 6960-6964
2023
[j16]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/DalmiaOLEWMZM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/DalmiaOLEWMZM23
Siddharth Dalmia, Dmytro Okhonko, Mike Lewis, Sergey Edunov, Shinji Watanabe, Florian Metze, Luke Zettlemoyer, Abdelrahman Mohamed:
LegoNN: Building Modular Encoder-Decoder Models. IEEE ACM Trans. Audio Speech Lang. Process. 31: 3112-3126 (2023)
[c214]
- view
  authority control:
- export record
  dblp key:
  - conf/eacl/YanDHNMBW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eacl/YanDHNMBW23
Brian Yan, Siddharth Dalmia, Yosuke Higuchi, Graham Neubig, Florian Metze, Alan W. Black, Shinji Watanabe:
CTC Alignments Improve Autoregressive Translation. EACL 2023: 1615-1631
2022
[c213]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/LiMM0B22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/LiMM0B22
Xinjian Li, Florian Metze, David R. Mortensen, Shinji Watanabe, Alan W. Black:
Zero-shot Learning for Grapheme to Phoneme Conversion with Language Ensemble. ACL (Findings) 2022: 2106-2115
[c212]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/AfourasAFVM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/AfourasAFVM22
Triantafyllos Afouras, Yuki M. Asano, Francois Fagan, Andrea Vedaldi, Florian Metze:
Self-supervised object detection from audio-visual correspondence. CVPR 2022: 10565-10576
[c211]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/ParkAMXMKA22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/ParkAMXMKA22
Yookoon Park, Mahmoud Azab, Seungwhan Moon, Bo Xiong, Florian Metze, Gourab Kundu, Kirmani Ahmed:
Normalized Contrastive Learning for Text-Video Retrieval. EMNLP 2022: 248-260
[c210]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/PalaskarBBMBM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/PalaskarBBMBM22
Shruti Palaskar, Akshita Bhagia, Yonatan Bisk, Florian Metze, Alan W. Black, Ana Marasovic:
On Advances in Text Generation from Images Beyond Captioning: A Case Study in Self-Rationalization. EMNLP (Findings) 2022: 2644-2657
[c209]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/AroraDYMB022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/AroraDYMB022
Siddhant Arora, Siddharth Dalmia, Brian Yan, Florian Metze, Alan W. Black, Shinji Watanabe:
Token-level Sequence Labeling for Spoken Language Understanding using Compositional End-to-End Models. EMNLP (Findings) 2022: 5419-5429
[c208]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiQLHM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiQLHM22
Juncheng B. Li, Shuhui Qu, Xinjian Li, Bernie Po-Yao Huang, Florian Metze:
On Adversarial Robustness Of Large-Scale Audio Visual Learning. ICASSP 2022: 231-235
[c207]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SharmaPBM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SharmaPBM22
Roshan Sharma, Shruti Palaskar, Alan W. Black, Florian Metze:
End-to-End Speech Summarization Using Restricted Self-Attention. ICASSP 2022: 8072-8076
[c206]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0001Q0M22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0001Q0M22
Juncheng Li, Shuhui Qu, Po-Yao Huang, Florian Metze:
AudioTagging Done Right: 2nd comparison of deep learning methods for environmental sound classification. INTERSPEECH 2022: 1521-1525
[c205]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiMMB022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiMMB022
Xinjian Li, Florian Metze, David R. Mortensen, Alan W. Black, Shinji Watanabe:
ASR2K: Speech Recognition for Around 2000 Languages without Audio. INTERSPEECH 2022: 4885-4889
[c204]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/lrec/LiMMB022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/lrec/LiMMB022
Xinjian Li, Florian Metze, David R. Mortensen, Alan W. Black, Shinji Watanabe:
Phone Inventories and Recognition for Every Language. LREC 2022: 1061-1067
[c203]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/000100BAGMF22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/000100BAGMF22
Po-Yao Huang, Hu Xu, Juncheng Li, Alexei Baevski, Michael Auli, Wojciech Galuba, Florian Metze, Christoph Feichtenhofer:
Masked Autoencoders that Listen. NeurIPS 2022
[i78]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-12122
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-12122
Juncheng B. Li, Shuhui Qu, Xinjian Li, Po-Yao Huang, Florian Metze:
On Adversarial Robustness of Large-scale Audio Visual Learning. CoRR abs/2203.12122 (2022)
[i77]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-13448
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-13448
Juncheng B. Li, Shuhui Qu, Po-Yao Huang, Florian Metze:
AudioTagging Done Right: 2nd comparison of deep learning methods for environmental sound classification. CoRR abs/2203.13448 (2022)
[i76]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-03268
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-03268
Juncheng B. Li, Shuhui Qu, Florian Metze:
Robustness of Neural Architectures for Audio Event Detection. CoRR abs/2205.03268 (2022)
[i75]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-11686
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-11686
Shruti Palaskar, Akshita Bhagia, Yonatan Bisk, Florian Metze, Alan W. Black, Ana Marasovic:
On Advances in Text Generation from Images Beyond Captioning: A Case Study in Self-Rationalization. CoRR abs/2205.11686 (2022)
[i74]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-03318
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-03318
Siddharth Dalmia, Dmytro Okhonko, Mike Lewis, Sergey Edunov, Shinji Watanabe, Florian Metze, Luke Zettlemoyer, Abdelrahman Mohamed:
LegoNN: Building Modular Encoder-Decoder Models. CoRR abs/2206.03318 (2022)
[i73]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-06405
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-06405
Po-Yao Huang, Hu Xu, Juncheng Li, Alexei Baevski, Michael Auli, Wojciech Galuba, Florian Metze, Christoph Feichtenhofer:
Masked Autoencoders that Listen. CoRR abs/2207.06405 (2022)
[i72]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-02842
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-02842
Xinjian Li, Florian Metze, David R. Mortensen, Alan W. Black, Shinji Watanabe:
ASR2K: Speech Recognition for Around 2000 Languages without Audio. CoRR abs/2209.02842 (2022)
[i71]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-05200
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-05200
Brian Yan, Siddharth Dalmia, Yosuke Higuchi, Graham Neubig, Florian Metze, Alan W. Black, Shinji Watanabe:
CTC Alignments Improve Autoregressive Translation. CoRR abs/2210.05200 (2022)
[i70]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-07171
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-07171
Zheng Wang, Juncheng B. Li, Shuhui Qu, Florian Metze, Emma Strubell:
SQuAT: Sharpness- and Quantization-Aware Training for BERT. CoRR abs/2210.07171 (2022)
[i69]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-15734
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-15734
Siddhant Arora, Siddharth Dalmia, Brian Yan, Florian Metze, Alan W. Black, Shinji Watanabe:
Token-level Sequence Labeling for Spoken Language Understanding using Compositional End-to-End Models. CoRR abs/2210.15734 (2022)
[i68]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-05603
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-05603
Zheng Wang, Juncheng B. Li, Shuhui Qu, Florian Metze, Emma Strubell:
Error-aware Quantization through Noise Tempering. CoRR abs/2212.05603 (2022)
[i67]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-11790
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-11790
Yookoon Park, Mahmoud Azab, Bo Xiong, Seungwhan Moon, Florian Metze, Gourab Kundu, Kirmani Ahmed:
Normalized Contrastive Learning for Text-Video Retrieval. CoRR abs/2212.11790 (2022)
2021
[c202]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/XuGHAAFMZ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/XuGHAAFMZ21
Hu Xu, Gargi Ghosh, Po-Yao Huang, Prahal Arora, Masoumeh Aminzadeh, Christoph Feichtenhofer, Florian Metze, Luke Zettlemoyer:
VLM: Task-agnostic Video-Language Model Pre-training for Video Understanding. ACL/IJCNLP (Findings) 2021: 4227-4239
[c201]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/DuartePVGDMTG21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/DuartePVGDMTG21
Amanda Cardoso Duarte, Shruti Palaskar, Lucas Ventura, Deepti Ghadiyaram, Kenneth DeHaan, Florian Metze, Jordi Torres, Xavier Giró-i-Nieto:
How2Sign: A Large-Scale Multimodal Dataset for Continuous American Sign Language. CVPR 2021: 2735-2744
[c200]
- view
  authority control:
- export record
  dblp key:
  - conf/eacl/RavichanderDRMH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eacl/RavichanderDRMH21
Abhilasha Ravichander, Siddharth Dalmia, Maria Ryskina, Florian Metze, Eduard H. Hovy, Alan W. Black:
NoiseQA: Challenge Set Evaluation for User-Centric Question Answering. EACL 2021: 2976-2992
[c199]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/XuG0OAMZF21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/XuG0OAMZF21
Hu Xu, Gargi Ghosh, Po-Yao Huang, Dmytro Okhonko, Armen Aghajanyan, Florian Metze, Luke Zettlemoyer, Christoph Feichtenhofer:
VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding. EMNLP (1) 2021: 6787-6800
[c198]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiMQ0M21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiMQ0M21
Juncheng B. Li, Kaixin Ma, Shuhui Qu, Po-Yao Huang, Florian Metze:
Audio-Visual Event Recognition Through the Lens of Adversary. ICASSP 2021: 616-620
[c197]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiMMB21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiMMB21
Xinjian Li, David R. Mortensen, Florian Metze, Alan W. Black:
Multilingual Phonetic Dataset for Low Resource Speech Recognition. ICASSP 2021: 6958-6962
[c196]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Li0YBM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Li0YBM21
Xinjian Li, Juncheng Li, Jiali Yao, Alan W. Black, Florian Metze:
Phone Distribution Estimation for Low Resource Languages. ICASSP 2021: 7233-7237
[c195]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/Patrick0MMVAH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/Patrick0MMVAH21
Mandela Patrick, Po-Yao Huang, Ishan Misra, Florian Metze, Andrea Vedaldi, Yuki M. Asano, João F. Henriques:
Space-Time Crop & Attend: Improving Cross-modal Video Representation Learning. ICCV 2021: 10540-10552
[c194]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/Patrick0AMHHV21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/Patrick0AMHHV21
Mandela Patrick, Po-Yao Huang, Yuki Markus Asano, Florian Metze, Alexander G. Hauptmann, João F. Henriques, Andrea Vedaldi:
Support-set bottlenecks for video-text representation learning. ICLR 2021
[c193]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PalaskarSBM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PalaskarSBM21
Shruti Palaskar, Ruslan Salakhutdinov, Alan W. Black, Florian Metze:
Multimodal Speech Summarization Through Semantic Concept Learning. Interspeech 2021: 791-795
[c192]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AroraO0DM0B21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AroraO0DM0B21
Siddhant Arora, Alissa Ostapenko, Vijay Viswanathan, Siddharth Dalmia, Florian Metze, Shinji Watanabe, Alan W. Black:
Rethinking End-to-End Evaluation of Decomposable Tasks: A Case Study on Spoken Language Understanding. Interspeech 2021: 1264-1268
[c191]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Li0MB21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Li0MB21
Xinjian Li, Juncheng Li, Florian Metze, Alan W. Black:
Hierarchical Phone Recognition with Compositional Phonetics. Interspeech 2021: 2461-2465
[c190]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YanDMM021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YanDMM021
Brian Yan, Siddharth Dalmia, David R. Mortensen, Florian Metze, Shinji Watanabe:
Differentiable Allophone Graphs for Language-Universal Speech Recognition. Interspeech 2021: 2471-2475
[c189]
- view
  authority control:
- export record
  dblp key:
  - conf/naacl/DalmiaYRMW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/DalmiaYRMW21
Siddharth Dalmia, Brian Yan, Vikas Raunak, Florian Metze, Shinji Watanabe:
Searchable Hidden Intermediates for End-to-End Models of Decomposable Sequence Tasks. NAACL-HLT 2021: 1882-1896
[c188]
- view
  authority control:
- export record
  dblp key:
  - conf/naacl/HuangPHNMH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/HuangPHNMH21
Poyao Huang, Mandela Patrick, Junjie Hu, Graham Neubig, Florian Metze, Alex Hauptmann:
Multilingual Multimodal Pre-training for Zero-Shot Cross-Lingual Transfer of Vision-Language Models. NAACL-HLT 2021: 2443-2459
[c187]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/PatrickCAMMFVH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/PatrickCAMMFVH21
Mandela Patrick, Dylan Campbell, Yuki M. Asano, Ishan Misra, Florian Metze, Christoph Feichtenhofer, Andrea Vedaldi, João F. Henriques:
Keeping Your Eye on the Ball: Trajectory Attention in Video Transformers. NeurIPS 2021: 12493-12506
[e5]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/2021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/2021
Heng Tao Shen, Yueting Zhuang, John R. Smith, Yang Yang, Pablo César, Florian Metze, Balakrishnan Prabhakaran:
MM '21: ACM Multimedia Conference, Virtual Event, China, October 20 - 24, 2021. ACM 2021, ISBN 978-1-4503-8651-7 [contents]
[i66]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-08345
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-08345
Abhilasha Ravichander, Siddharth Dalmia, Maria Ryskina, Florian Metze, Eduard H. Hovy, Alan W. Black:
NoiseQA: Challenge Set Evaluation for User-Centric Question Answering. CoRR abs/2102.08345 (2021)
[i65]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-08849
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-08849
Po-Yao Huang, Mandela Patrick, Junjie Hu, Graham Neubig, Florian Metze, Alexander G. Hauptmann:
Multilingual Multimodal Pre-training for Zero-Shot Cross-Lingual Transfer of Vision-Language Models. CoRR abs/2103.08849 (2021)
[i64]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-10211
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-10211
Mandela Patrick, Yuki Markus Asano, Bernie Huang, Ishan Misra, Florian Metze, João F. Henriques, Andrea Vedaldi:
Space-Time Crop & Attend: Improving Cross-modal Video Representation Learning. CoRR abs/2103.10211 (2021)
[i63]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2104-06401
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-06401
Triantafyllos Afouras, Yuki Markus Asano, Francois Fagan, Andrea Vedaldi, Florian Metze:
Self-supervised object detection from audio-visual correspondence. CoRR abs/2104.06401 (2021)
[i62]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2105-00573
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2105-00573
Siddharth Dalmia, Brian Yan, Vikas Raunak, Florian Metze, Shinji Watanabe:
Searchable Hidden Intermediates for End-to-End Models of Decomposable Sequence Tasks. CoRR abs/2105.00573 (2021)
[i61]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2105-09996
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2105-09996
Hu Xu, Gargi Ghosh, Po-Yao Huang, Prahal Arora, Masoumeh Aminzadeh, Christoph Feichtenhofer, Florian Metze, Luke Zettlemoyer:
VLM: Task-agnostic Video-Language Model Pre-training for Video Understanding. CoRR abs/2105.09996 (2021)
[i60]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-05392
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-05392
Mandela Patrick, Dylan Campbell, Yuki Markus Asano, Ishan Misra, Florian Metze, Christoph Feichtenhofer, Andrea Vedaldi, João F. Henriques:
Keeping Your Eye on the Ball: Trajectory Attention in Video Transformers. CoRR abs/2106.05392 (2021)
[i59]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-15065
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-15065
Siddhant Arora, Alissa Ostapenko, Vijay Viswanathan, Siddharth Dalmia, Florian Metze, Shinji Watanabe, Alan W. Black:
Rethinking End-to-End Evaluation of Decomposable Tasks: A Case Study on Spoken Language Understanding. CoRR abs/2106.15065 (2021)
[i58]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2107-11628
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-11628
Brian Yan, Siddharth Dalmia, David R. Mortensen, Florian Metze, Shinji Watanabe:
Differentiable Allophone Graphs for Language-Universal Speech Recognition. CoRR abs/2107.11628 (2021)
[i57]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2109-14084
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-14084
Hu Xu, Gargi Ghosh, Po-Yao Huang, Dmytro Okhonko, Armen Aghajanyan, Florian Metze, Luke Zettlemoyer, Christoph Feichtenhofer:
VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding. CoRR abs/2109.14084 (2021)
[i56]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-06263
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-06263
Roshan Sharma, Shruti Palaskar, Alan W. Black, Florian Metze:
Speech Summarization using Restricted Self-Attention. CoRR abs/2110.06263 (2021)
2020
[j15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/csl/PalaskarSM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/PalaskarSM20
Shruti Palaskar, Ramon Sanabria, Florian Metze:
Transfer learning for multimodal dialog. Comput. Speech Lang. 64: 101093 (2020)
[j14]
- view
  authority control:
- export record
  dblp key:
  - journals/jstsp/SpeciaBCDEGHLLL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/SpeciaBCDEGHLLL20
Lucia Specia, Loïc Barrault, Ozan Caglayan, Amanda Cardoso Duarte, Desmond Elliott, Spandana Gella, Nils Holzenberger, Chiraag Lala, Sun Jae Lee, Jindrich Libovický, Pranava Madhyastha, Florian Metze, Karl Mulligan, Alissa Ostapenko, Shruti Palaskar, Ramon Sanabria, Josiah Wang, Raman Arora:
Grounded Sequence to Sequence Transduction. IEEE J. Sel. Top. Signal Process. 14(3): 577-591 (2020)
[j13]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/ScharenborgOPAC20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ScharenborgOPAC20
Odette Scharenborg, Lucas Ondel, Shruti Palaskar, Philip Arthur, Francesco Ciannella, Mingxing Du, Elin Larsen, Danny Merkx, Rachid Riad, Liming Wang, Emmanuel Dupoux, Laurent Besacier, Alan W. Black, Mark Hasegawa-Johnson, Florian Metze, Graham Neubig, Sebastian Stüker, Pierre Godard, Markus Müller:
Speech Technology for Unwritten Languages. IEEE ACM Trans. Audio Speech Lang. Process. 28: 964-975 (2020)
[j12]
- view
  authority control:
- export record
  dblp key:
  - journals/titb/Dong0RBLDDMYS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/titb/Dong0RBLDDMYS20
Fengquan Dong, Kun Qian, Zhao Ren, Alice Baird, Xinjian Li, Zhenyu Dai, Bo Dong, Florian Metze, Yoshiharu Yamamoto, Björn W. Schuller:
Machine Listening for Heart Status Monitoring: Introducing and Benchmarking HSS - The Heart Sounds Shenzhen Corpus. IEEE J. Biomed. Health Informatics 24(7): 2082-2092 (2020)
[c186]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/LiDM0BM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/LiDM0BM20
Xinjian Li, Siddharth Dalmia, David R. Mortensen, Juncheng Li, Alan W. Black, Florian Metze:
Towards Zero-Shot Learning for Automatic Phonemic Transcription. AAAI 2020: 8261-8268
[c185]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ei-imawm/ZhouEM0W20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ei-imawm/ZhouEM0W20
Zhong Zhou, Isak Czeresnia Etinger, Florian Metze, Alexander Hauptmann, Alexander Waibel:
Gun Source and Muzzle Head Detection. IMAWM 2020: 1-11
[c184]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/SrinivasanSME20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/SrinivasanSME20
Tejas Srinivasan, Ramon Sanabria, Florian Metze, Desmond Elliott:
Fine-Grained Grounding for Multimodal Speech Recognition. EMNLP (Findings) 2020: 2667-2677
[c183]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/RaunakD0M20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/RaunakD0M20
Vikas Raunak, Siddharth Dalmia, Vivek Gupta, Florian Metze:
On Long-Tailed Phenomena in Neural Machine Translation. EMNLP (Findings) 2020: 3088-3095
[c182]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SrinivasanSM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SrinivasanSM20
Tejas Srinivasan, Ramon Sanabria, Florian Metze:
Looking Enhances Listening: Recovering Missing Speech Using Images. ICASSP 2020: 6304-6308
[c181]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ManiPMKM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ManiPMKM20
Anirudh Mani, Shruti Palaskar, Nimshi Venkat Meripo, Sandeep Konam, Florian Metze:
ASR Error Correction and Domain Adaptation Using Machine Translation. ICASSP 2020: 6344-6348
[c180]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiD0LLYAMNBM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiD0LLYAMNBM20
Xinjian Li, Siddharth Dalmia, Juncheng Li, Matthew Lee, Patrick Littell, Jiali Yao, Antonios Anastasopoulos, David R. Mortensen, Graham Neubig, Alan W. Black, Florian Metze:
Universal Phone Recognition with a Multilingual Allophone System. ICASSP 2020: 8249-8253
[c179]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JainKMZMS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JainKMZMS20
Mahaveer Jain, Gil Keren, Jay Mahadeokar, Geoffrey Zweig, Florian Metze, Yatharth Saraf:
Contextual RNN-T for Open Domain ASR. INTERSPEECH 2020: 11-15
[c178]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/QiuLLMC20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/QiuLLMC20
Zimeng Qiu, Yiyuan Li, Xinjian Li, Florian Metze, William M. Campbell:
Towards Context-Aware End-to-End Code-Switching Speech Recognition. INTERSPEECH 2020: 4776-4780
[c177]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/lrec/MortensenLLMRAB20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/lrec/MortensenLLMRAB20
David R. Mortensen, Xinjian Li, Patrick Littell, Alexis Michaud, Shruti Rijhwani, Antonios Anastasopoulos, Alan W. Black, Florian Metze, Graham Neubig:
AlloVera: A Multilingual Allophone Database. LREC 2020: 5329-5336
[c176]
- view
  authority control:
- export record
  dblp key:
  - conf/rep4nlp/RaunakKGM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rep4nlp/RaunakKGM20
Vikas Raunak, Vaibhav Kumar, Vivek Gupta, Florian Metze:
On Dimensional Linguistic Properties of the Word Embedding Space. RepL4NLP@ACL 2020: 156-165
[i55]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2001-11120
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2001-11120
Zhong Zhou, Isak Czeresnia Etinger, Florian Metze, Alexander G. Hauptmann, Alexander Waibel:
Gun Source and Muzzle Head Detection. CoRR abs/2001.11120 (2020)
[i54]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-05639
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-05639
Tejas Srinivasan, Ramon Sanabria, Florian Metze:
Looking Enhances Listening: Recovering Missing Speech Using Images. CoRR abs/2002.05639 (2020)
[i53]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-11781
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-11781
Xinjian Li, Siddharth Dalmia, David R. Mortensen, Juncheng Li, Alan W. Black, Florian Metze:
Towards Zero-shot Learning for Automatic Phonemic Transcription. CoRR abs/2002.11781 (2020)
[i52]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-11800
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-11800
Xinjian Li, Siddharth Dalmia, Juncheng Li, Matthew Lee, Patrick Littell, Jiali Yao, Antonios Anastasopoulos, David R. Mortensen, Graham Neubig, Alan W. Black, Florian Metze:
Universal Phone Recognition with a Multilingual Allophone System. CoRR abs/2002.11800 (2020)
[i51]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2003-07692
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2003-07692
Anirudh Mani, Shruti Palaskar, Nimshi Venkat Meripo, Sandeep Konam, Florian Metze:
ASR Error Correction and Domain Adaptation Using Machine Translation. CoRR abs/2003.07692 (2020)
[i50]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2004-08031
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2004-08031
David R. Mortensen, Xinjian Li, Patrick Littell, Alexis Michaud, Shruti Rijhwani, Antonios Anastasopoulos, Alan W. Black, Florian Metze, Graham Neubig:
AlloVera: A Multilingual Allophone Database. CoRR abs/2004.08031 (2020)
[i49]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2008-08143
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-08143
Amanda Cardoso Duarte, Shruti Palaskar, Deepti Ghadiyaram, Kenneth DeHaan, Florian Metze, Jordi Torres, Xavier Giró-i-Nieto:
How2Sign: A Large-scale Multimodal Dataset for Continuous American Sign Language. CoRR abs/2008.08143 (2020)
[i48]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2009-05739
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2009-05739
Ze Cheng, Juncheng Li, Chenxu Wang, Jixuan Gu, Hao Xu, Xinjian Li, Florian Metze:
Revisiting Factorizing Aggregated Posterior in Learning Disentangled Representations. CoRR abs/2009.05739 (2020)
[i47]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-02384
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-02384
Tejas Srinivasan, Ramon Sanabria, Florian Metze, Desmond Elliott:
Fine-Grained Grounding for Multimodal Speech Recognition. CoRR abs/2010.02384 (2020)
[i46]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-02824
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-02824
Mandela Patrick, Po-Yao Huang, Yuki Markus Asano, Florian Metze, Alexander G. Hauptmann, João F. Henriques, Andrea Vedaldi:
Support-set bottlenecks for video-text representation learning. CoRR abs/2010.02824 (2020)
[i45]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-04924
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-04924
Vikas Raunak, Siddharth Dalmia, Vivek Gupta, Florian Metze:
On Long-Tailed Phenomena in Neural Machine Translation. CoRR abs/2010.04924 (2020)
[i44]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-08642
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-08642
Tejas Srinivasan, Ramon Sanabria, Florian Metze, Desmond Elliott:
Multimodal Speech Recognition with Unstructured Audio Masking. CoRR abs/2010.08642 (2020)
[i43]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-07430
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-07430
Juncheng B. Li, Kaixin Ma, Shuhui Qu, Po-Yao Huang, Florian Metze:
Audio-Visual Event Recognition through the lens of Adversary. CoRR abs/2011.07430 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[j11]
- view
  authority control:
- export record
  dblp key:
  - journals/ijmir/MithunLMR19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijmir/MithunLMR19
Niluthpol Chowdhury Mithun, Juncheng Li, Florian Metze, Amit K. Roy-Chowdhury:
Joint embeddings with multimodal cues for video-text retrieval. Int. J. Multim. Inf. Retr. 8(1): 3-18 (2019)
[j10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/speech/RasanenSKRBCMCR19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/RasanenSKRBCMCR19
Okko Räsänen, Shreyas Seshadri, Julien Karadayi, Eric Riebling, John P. Bunce, Alejandrina Cristià, Florian Metze, Marisa Casillas, Celia Rosemberg, Elika Bergelson, Melanie Soderstrom:
Automatic word count estimation from daylong child-centered recordings in various language environments using language-independent syllabification of speech. Speech Commun. 113: 63-80 (2019)
[c175]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/KimDM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/KimDM19
Suyoun Kim, Siddharth Dalmia, Florian Metze:
Gated Embeddings in End-to-End Speech Recognition for Conversational-Context Fusion. ACL (1) 2019: 1131-1141
[c174]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/PalaskarLGM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/PalaskarLGM19
Shruti Palaskar, Jindrich Libovický, Spandana Gella, Florian Metze:
Multimodal Abstractive Summarization for How2 Videos. ACL (1) 2019: 6587-6596
[c173]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/0005LM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/0005LM19
Yun Wang, Juncheng Li, Florian Metze:
A Comparison of Five Multiple Instance Learning Pooling Functions for Sound Event Detection with Weak Labeling. ICASSP 2019: 31-35
[c172]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/0005M19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/0005M19
Yun Wang, Florian Metze:
Connectionist Temporal Localization for Sound Event Detection with Sequential Labeling. ICASSP 2019: 745-749
[c171]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DalmiaLBM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DalmiaLBM19
Siddharth Dalmia, Xinjian Li, Alan W. Black, Florian Metze:
Phoneme Level Language Models for Sequence Based Low Resource ASR. ICASSP 2019: 6091-6095
[c170]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PalaskarRM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PalaskarRM19
Shruti Palaskar, Vikas Raunak, Florian Metze:
Learned in Speech Recognition: Contextual Acoustic Word Embeddings. ICASSP 2019: 6530-6534
[c169]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HolzenbergerPMM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HolzenbergerPMM19
Nils Holzenberger, Shruti Palaskar, Pranava Madhyastha, Florian Metze, Raman Arora:
Learning from Multiview Correlations in Open-domain Videos. ICASSP 2019: 8628-8632
[c168]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/CaglayanSPBM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/CaglayanSPBM19
Ozan Caglayan, Ramon Sanabria, Shruti Palaskar, Loïc Barrault, Florian Metze:
Multimodal Grounding for Sequence-to-sequence Speech Recognition. ICASSP 2019: 8648-8652
[c167]
- view
  authority control:
- export record
  dblp key:
  - conf/inlg/RaunakCLXM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/inlg/RaunakCLXM19
Vikas Raunak, Sang Keun Choe, Quanyang Lu, Yi Xu, Florian Metze:
On Leveraging the Visual Modality for Neural Machine Translation. INLG 2019: 147-151
[c166]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiDBM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiDBM19
Xinjian Li, Siddharth Dalmia, Alan W. Black, Florian Metze:
Multilingual Speech Recognition with Corpus Relatedness Sampling. INTERSPEECH 2019: 2120-2124
[c165]
- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/LiZDBM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiZDBM19
Xinjian Li, Zhong Zhou, Siddharth Dalmia, Alan W. Black, Florian Metze:
SANTLR: Speech Annotation Toolkit for Low Resource Languages. INTERSPEECH 2019: 3681-3682
[c164]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimDM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimDM19
Suyoun Kim, Siddharth Dalmia, Florian Metze:
Cross-Attention End-to-End ASR for Two-Party Conversations. INTERSPEECH 2019: 4380-4384
[c163]
- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/Metze19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Metze19
Florian Metze:
Survey Talk: Multimodal Processing of Speech and Language. INTERSPEECH 2019
[c162]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/iwslt/SrinivasanSM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwslt/SrinivasanSM19
Tejas Srinivasan, Ramon Sanabria, Florian Metze:
CMU's Machine Translation System for IWSLT 2019. IWSLT 2019
[c161]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/iwslt/SrinivasanSM19a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwslt/SrinivasanSM19a
Tejas Srinivasan, Ramon Sanabria, Florian Metze:
Multitask Learning For Different Subword Segmentations In Neural Machine Translation. IWSLT 2019
[c160]
- view
  - electronic edition @ ceur-ws.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/mediaeval/MoriyaSMJ19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mediaeval/MoriyaSMJ19
Yasufumi Moriya, Ramon Sanabria, Florian Metze, Gareth J. F. Jones:
MediaEval 2019: Eyes and Ears Together. MediaEval 2019
[c159]
- view
  authority control:
- export record
  dblp key:
  - conf/naacl/KimM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/KimM19
Suyoun Kim, Florian Metze:
Acoustic-to-Word Models with Conversational Context Information. NAACL-HLT (1) 2019: 2766-2771
[c158]
- view
- export record
  dblp key:
  - conf/nips/0001QLSKM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/0001QLSKM19
Juncheng Li, Shuhui Qu, Xinjian Li, Joseph Szurley, J. Zico Kolter, Florian Metze:
Adversarial Music: Real world Audio Adversary against Wake-word Detection System. NeurIPS 2019: 11908-11918
[c157]
- view
  authority control:
- export record
  dblp key:
  - conf/rep4nlp/RaunakGM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rep4nlp/RaunakGM19
Vikas Raunak, Vivek Gupta, Florian Metze:
Effective Dimensionality Reduction for Word Embeddings. RepL4NLP@ACL 2019: 235-243
[i42]
- view
  - electronic edition @ nist.gov (open access)
  - details & citations
- export record
  dblp key:
  - conf/tac/HovyCCGHMMSDCCK19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/tac/HovyCCGHMMSDCCK19
Eduard H. Hovy, Jaime G. Carbonell, Hans Chalupsky, Anatole Gershman, Alex Hauptmann, Florian Metze, Teruko Mitamura, Zaid Sheikh, Ankit Dangi, Aditi Chaudhary, Xianyang Chen, Xiang Kong, Bernie Huang, Salvador Medina, Hector Liu, Xuezhe Ma, Maria Ryskina, Ramon Sanabria, Varun Gangal:
OPERA: Operations-oriented Probabilistic Extraction, Reasoning, and Analysis. TAC 2019
[i41]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1902-06833
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1902-06833
Shruti Palaskar, Vikas Raunak, Florian Metze:
Learned In Speech Recognition: Contextual Acoustic Word Embeddings. CoRR abs/1902.06833 (2019)
[i40]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1902-07613
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1902-07613
Siddharth Dalmia, Xinjian Li, Alan W. Black, Florian Metze:
Phoneme Level Language Models for Sequence Based Low Resource ASR. CoRR abs/1902.07613 (2019)
[i39]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1902-08899
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1902-08899
Aditi Chaudhary, Siddharth Dalmia, Junjie Hu, Xinjian Li, Austin Matthews, Aldrian Obaja Muis, Naoki Otani, Shruti Rijhwani, Zaid Sheikh, Nidhi Vyas, Xinyi Wang, Jiateng Xie, Ruochen Xu, Chunting Zhou, Peter J. Jansen, Yiming Yang, Lori S. Levin, Florian Metze, Teruko Mitamura, David R. Mortensen, Graham Neubig, Eduard H. Hovy, Alan W. Black, Jaime G. Carbonell, Graham Horwood, Shabnam Tafreshi, Mona T. Diab, Efsun Sarioglu Kayi, Noura Farra, Kathleen R. McKeown:
The ARIEL-CMU Systems for LoReHLT18. CoRR abs/1902.08899 (2019)
[i38]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1905-08796
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-08796
Suyoun Kim, Florian Metze:
Acoustic-to-Word Models with Conversational Context Information. CoRR abs/1905.08796 (2019)
[i37]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1906-06147
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-06147
Yasufumi Moriya, Ramon Sanabria, Florian Metze, Gareth J. F. Jones:
Grounding Object Detections With Transcriptions. CoRR abs/1906.06147 (2019)
[i36]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1906-07901
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-07901
Shruti Palaskar, Jindrich Libovický, Spandana Gella, Florian Metze:
Multimodal Abstractive Summarization for How2 Videos. CoRR abs/1906.07901 (2019)
[i35]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1906-11604
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-11604
Suyoun Kim, Siddharth Dalmia, Florian Metze:
Gated Embeddings in End-to-End Speech Recognition for Conversational-Context Fusion. CoRR abs/1906.11604 (2019)
[i34]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1907-00477
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1907-00477
Tejas Srinivasan, Ramon Sanabria, Florian Metze:
Analyzing Utility of Visual Context in Multimodal Speech Recognition Under Noisy Conditions. CoRR abs/1907.00477 (2019)
[i33]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1907-10726
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1907-10726
Suyoun Kim, Siddharth Dalmia, Florian Metze:
Cross-Attention End-to-End ASR for Two-Party Conversations. CoRR abs/1907.10726 (2019)
[i32]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1908-01060
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1908-01060
Xinjian Li, Siddharth Dalmia, Alan W. Black, Florian Metze:
Multilingual Speech Recognition with Corpus Relatedness Sampling. CoRR abs/1908.01060 (2019)
[i31]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1908-01067
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1908-01067
Xinjian Li, Zhong Zhou, Siddharth Dalmia, Alan W. Black, Florian Metze:
SANTLR: Speech Annotation Toolkit for Low Resource Languages. CoRR abs/1908.01067 (2019)
[i30]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1910-02211
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-02211
Vikas Raunak, Vaibhav Kumar, Vivek Gupta, Florian Metze:
On Dimensional Linguistic Properties of the Word Embedding Space. CoRR abs/1910.02211 (2019)
[i29]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1910-02754
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-02754
Vikas Raunak, Sang Keun Choe, Quanyang Lu, Yi Xu, Florian Metze:
On Leveraging the Visual Modality for Neural Machine Translation. CoRR abs/1910.02754 (2019)
[i28]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1910-12368
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-12368
Tejas Srinivasan, Ramon Sanabria, Florian Metze:
Multitask Learning For Different Subword Segmentations In Neural Machine Translation. CoRR abs/1910.12368 (2019)
[i27]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1911-00126
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-00126
Juncheng B. Li, Shuhui Qu, Xinjian Li, J. Zico Kolter, Florian Metze:
Adversarial Music: Real World Audio Adversary Against Wake-word Detection System. CoRR abs/1911.00126 (2019)
[i26]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1911-01497
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-01497
Vikas Raunak, Vaibhav Kumar, Florian Metze:
On Compositionality in Neural Machine Translation. CoRR abs/1911.01497 (2019)
[i25]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1911-03782
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-03782
Siddharth Dalmia, Abdelrahman Mohamed, Mike Lewis, Florian Metze, Luke Zettlemoyer:
Enforcing Encoder-Decoder Modularity in Sequence-to-Sequence Models. CoRR abs/1911.03782 (2019)
2018
[c156]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DalmiaSMB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DalmiaSMB18
Siddharth Dalmia, Ramon Sanabria, Florian Metze, Alan W. Black:
Sequence-Based Multi-Lingual Low Resource Speech Recognition. ICASSP 2018: 4909-4913
[c155]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ScharenborgBBHM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ScharenborgBBHM18
Odette Scharenborg, Laurent Besacier, Alan W. Black, Mark Hasegawa-Johnson, Florian Metze, Graham Neubig, Sebastian Stüker, Pierre Godard, Markus Müller, Lucas Ondel, Shruti Palaskar, Philip Arthur, Francesco Ciannella, Mingxing Du, Elin Larsen, Danny Merkx, Rachid Riad, Liming Wang, Emmanuel Dupoux:
Linguistic Unit Discovery from Multi-Modal Inputs in Unwritten Languages: Summary of the "Speaking Rosetta" JSALT 2017 Workshop. ICASSP 2018: 4979-4983
[c154]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/RyantBCCDGKKKKL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/RyantBCCDGKKKKL18
Neville Ryant, Elika Bergelson, Kenneth Church, Alejandrina Cristià, Jun Du, Sriram Ganapathy, Sanjeev Khudanpur, Diana Kowalski, Mahesh Krishnamoorthy, Rajat Kulshreshta, Mark Y. Liberman, Yu-Ding Lu, Matthew Maciejewski, Florian Metze, Ján Profant, Lei Sun, Yu Tsao, Zhou Yu:
Enhancement and Analysis of Conversational Speech: JSALT 2017. ICASSP 2018: 5154-5158
[c153]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PalaskarSM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PalaskarSM18
Shruti Palaskar, Ramon Sanabria, Florian Metze:
End-to-end Multimodal Speech Recognition. ICASSP 2018: 5774-5778
[c152]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiWSMD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiWSMD18
Juncheng Li, Yun Wang, Joseph Szurley, Florian Metze, Samarjit Das:
A Light-Weight Multimodal Framework for Improved Environmental Audio Tagging. ICASSP 2018: 6832-6836
[c151]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZenkelSMW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZenkelSMW18
Thomas Zenkel, Ramon Sanabria, Florian Metze, Alex Waibel:
Subword and Crossword Units for CTC Acoustic Models. INTERSPEECH 2018: 396-400
[c150]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangLM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangLM18
Yun Wang, Juncheng Li, Florian Metze:
Comparing the Max and Noisy-Or Pooling Functions in Multiple Instance Learning for Weakly Supervised Sequence Learning Tasks. INTERSPEECH 2018: 1339-1343
[c149]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FrancRKWSMC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FrancRKWSMC18
Adrien Le Franc, Eric Riebling, Julien Karadayi, Yun Wang, Camila Scaff, Florian Metze, Alejandrina Cristià:
The ACLEW DiViMe: An Easy-to-use Diarization Tool. INTERSPEECH 2018: 1383-1387
[c148]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TsengLWMSD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TsengLWMSD18
Shao-Yen Tseng, Juncheng Li, Yun Wang, Florian Metze, Joseph Szurley, Samarjit Das:
Multiple Instance Deep Learning for Weakly Supervised Small-Footprint Audio Event Detection. INTERSPEECH 2018: 3279-3283
[c147]
- view
  - electronic edition @ lrec-conf.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/lrec/LiCWM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/lrec/LiCWM18
Boyang Li, Beth Cardier, Tong Wang, Florian Metze:
Annotating High-Level Structures of Short Stories and Personal Anecdotes. LREC 2018
[c146]
- view
  - electronic edition @ ceur-ws.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/mediaeval/MoriyaSMJ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mediaeval/MoriyaSMJ18
Yasufumi Moriya, Ramon Sanabria, Florian Metze, Gareth J. F. Jones:
Eyes and Ears Together: New Task for Multimodal Spoken Content Analysis. MediaEval 2018
[c145]
- view
  authority control:
- export record
  dblp key:
  - conf/mir/MithunLMR18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mir/MithunLMR18
Niluthpol Chowdhury Mithun, Juncheng Li, Florian Metze, Amit K. Roy-Chowdhury:
Learning Joint Embedding with Multimodal Cues for Cross-Modal Video-Text Retrieval. ICMR 2018: 19-27
[c144]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/DalmiaLMB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/DalmiaLMB18
Siddharth Dalmia, Xinjian Li, Florian Metze, Alan W. Black:
Domain Robust Feature Extraction for Rapid Low Resource ASR Development. SLT 2018: 258-265
[c143]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/PalaskarM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/PalaskarM18
Shruti Palaskar, Florian Metze:
Acoustic-to-Word Recognition with Sequence-to-Sequence Models. SLT 2018: 397-404
[c142]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/KimM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/KimM18
Suyoun Kim, Florian Metze:
Dialog-Context Aware end-to-end Speech Recognition. SLT 2018: 434-440
[c141]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/SanabriaM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/SanabriaM18
Ramon Sanabria, Florian Metze:
Hierarchical Multitask Learning With CTC. SLT 2018: 485-490
[i24]
- view
  - electronic edition @ nist.gov (open access)
  - details & citations
- export record
  dblp key:
  - conf/tac/HovyBCCGHMMCCHL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/tac/HovyBCCGHMMCCHL18
Eduard H. Hovy, Taylor Berg-Kirkpatrick, Jaime G. Carbonell, Hans Chalupsky, Anatole Gershman, Alexander G. Hauptmann, Florian Metze, Teruko Mitamura, Aditi Chaudhary, Xianyang Chen, Bernie Po-Yao Huang, Hector Zhengzhong Liu, Xuezhe Ma, Shruti Palaskar, Dheeraj Rajagopal, Maria Ryskina, Ramon Sanabria:
OPERA: Operations-oriented Probabilistic Extraction, Reasoning, and Analysis. TAC 2018
[i23]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1802-05092
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1802-05092
Odette Scharenborg, Laurent Besacier, Alan W. Black, Mark Hasegawa-Johnson, Florian Metze, Graham Neubig, Sebastian Stüker, Pierre Godard, Markus Müller, Lucas Ondel, Shruti Palaskar, Philip Arthur, Francesco Ciannella, Mingxing Du, Elin Larsen, Danny Merkx, Rachid Riad, Liming Wang, Emmanuel Dupoux:
Linguistic unit discovery from multi-modal inputs in unwritten languages: Summary of the "Speaking Rosetta" JSALT 2017 Workshop. CoRR abs/1802.05092 (2018)
[i22]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1802-07420
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1802-07420
Siddharth Dalmia, Ramon Sanabria, Florian Metze, Alan W. Black:
Sequence-based Multi-lingual Low Resource Speech Recognition. CoRR abs/1802.07420 (2018)
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1804-01146
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1804-01146
Yun Wang, Juncheng Li, Florian Metze:
Comparing the Max and Noisy-Or Pooling Functions in Multiple Instance Learning for Weakly Supervised Sequence Learning Tasks. CoRR abs/1804.01146 (2018)
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1804-09713
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1804-09713
Shruti Palaskar, Ramon Sanabria, Florian Metze:
End-to-End Multimodal Speech Recognition. CoRR abs/1804.09713 (2018)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1807-07104
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1807-07104
Ramon Sanabria, Florian Metze:
Hierarchical Multi Task Learning With CTC. CoRR abs/1807.07104 (2018)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1807-09597
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1807-09597
Shruti Palaskar, Florian Metze:
Acoustic-to-Word Recognition with Sequence-to-Sequence Models. CoRR abs/1807.09597 (2018)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1807-10984
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1807-10984
Siddharth Dalmia, Xinjian Li, Florian Metze, Alan W. Black:
Domain Robust Feature Extraction for Rapid Low Resource ASR Development. CoRR abs/1807.10984 (2018)
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1808-02171
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1808-02171
Suyoun Kim, Florian Metze:
Dialog-context aware end-to-end speech recognition. CoRR abs/1808.02171 (2018)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1809-00241
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1809-00241
Ankit Shah, Harini Kesavamoorthy, Poorva Rane, Pramati Kalwad, Alexander G. Hauptmann, Florian Metze:
Activity Recognition on a Large Scale in Short Videos - Moments in Time Dataset. CoRR abs/1809.00241 (2018)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1810-09050
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1810-09050
Yun Wang, Juncheng Li, Florian Metze:
A Comparison of Five Multiple Instance Learning Pooling Functions for Sound Event Detection with Weak Labeling. CoRR abs/1810.09050 (2018)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1810-09052
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1810-09052
Yun Wang, Florian Metze:
Connectionist Temporal Localization for Sound Event Detection with Sequential Labeling. CoRR abs/1810.09052 (2018)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1811-00347
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-00347
Ramon Sanabria, Ozan Caglayan, Shruti Palaskar, Desmond Elliott, Loïc Barrault, Lucia Specia, Florian Metze:
How2: A Large-scale Dataset for Multimodal Language Understanding. CoRR abs/1811.00347 (2018)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1811-03865
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-03865
Ozan Caglayan, Ramon Sanabria, Shruti Palaskar, Loïc Barrault, Florian Metze:
Multimodal Grounding for Sequence-to-Sequence Speech Recognition. CoRR abs/1811.03865 (2018)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1811-08890
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-08890
Nils Holzenberger, Shruti Palaskar, Pranava Madhyastha, Florian Metze, Raman Arora:
Learning from Multiview Correlations in Open-Domain Videos. CoRR abs/1811.08890 (2018)
2017
[c140]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiDMQD17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiDMQD17
Juncheng Li, Wei Dai, Florian Metze, Shuhui Qu, Samarjit Das:
A comparison of Deep Learning methods for environmental sound detection. ICASSP 2017: 126-130
[c139]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangM17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangM17
Yun Wang, Florian Metze:
A first attempt at polyphonic sound event detection using connectionist temporal classification. ICASSP 2017: 2986-2990
[c138]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GuptaMNM17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GuptaMNM17
Abhinav Gupta, Yajie Miao, Leonardo Neves, Florian Metze:
Visual features for context-aware speech recognition. ICASSP 2017: 5020-5024
[c137]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZenkelSMNSSW17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZenkelSMNSSW17
Thomas Zenkel, Ramon Sanabria, Florian Metze, Jan Niehues, Matthias Sperber, Sebastian Stüker, Alex Waibel:
Comparison of Decoding Strategies for CTC Acoustic Models. INTERSPEECH 2017: 513-517
[c136]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangM17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangM17
Yun Wang, Florian Metze:
A Transfer Learning Based Feature Extractor for Polyphonic Sound Event Detection Using Connectionist Temporal Classification. INTERSPEECH 2017: 3097-3101
[c135]
- view
  - electronic edition @ nist.gov (open access)
  - details & citations
- export record
  dblp key:
  - conf/trecvid/MithunLMRD17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/trecvid/MithunLMRD17
Niluthpol Chowdhury Mithun, Juncheng B. Li, Florian Metze, Amit K. Roy-Chowdhury, Samarjit Das:
CMU-UCR-BOSCH @ TRECVID 2017: VIDEO TO TEXT RETRIEVAL. TRECVID 2017
[p3]
- view
  authority control:
- export record
  dblp key:
  - books/sp/17/WatanabeDMH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/books/sp/17/WatanabeDMH17
Shinji Watanabe, Marc Delcroix, Florian Metze, John R. Hershey:
Preliminaries. New Era for Robust Speech Recognition, Exploiting Deep Learning 2017: 3-17
[p2]
- view
  authority control:
- export record
  dblp key:
  - books/sp/17/MiaoM17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/books/sp/17/MiaoM17
Yajie Miao, Florian Metze:
End-to-End Architectures for Speech Recognition. New Era for Robust Speech Recognition, Exploiting Deep Learning 2017: 299-323
[p1]
- view
  authority control:
- export record
  dblp key:
  - books/sp/17/WatanabeHMDMH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/books/sp/17/WatanabeHMDMH17
Shinji Watanabe, Takaaki Hori, Yajie Miao, Marc Delcroix, Florian Metze, John R. Hershey:
Toolkits for Robust Speech Processing. New Era for Robust Speech Recognition, Exploiting Deep Learning 2017: 369-382
[e4]
- view
  authority control:
- export record
  dblp key:
  - books/sp/WDMH2017
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/books/sp/WDMH2017
Shinji Watanabe, Marc Delcroix, Florian Metze, John R. Hershey:
New Era for Robust Speech Recognition, Exploiting Deep Learning. Springer 2017, ISBN 978-3-319-64679-4 [contents]
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/LiDMQD17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/LiDMQD17
Juncheng Li, Wei Dai, Florian Metze, Shuhui Qu, Samarjit Das:
A Comparison of deep learning methods for environmental sound. CoRR abs/1703.06902 (2017)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1708-04469
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1708-04469
Thomas Zenkel, Ramon Sanabria, Florian Metze, Jan Niehues, Matthias Sperber, Sebastian Stüker, Alex Waibel:
Comparison of Decoding Strategies for CTC Acoustic Models. CoRR abs/1708.04469 (2017)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1710-06917
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1710-06917
Boyang Li, Beth Cardier, Tong Wang, Florian Metze:
Annotating High-Level Structures of Short Stories and Personal Anecdotes. CoRR abs/1710.06917 (2017)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1712-00489
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1712-00489
Abhinav Gupta, Yajie Miao, Leonardo Neves, Florian Metze:
Visual Features for Context-Aware Speech Recognition. CoRR abs/1712.00489 (2017)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1712-06855
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1712-06855
Thomas Zenkel, Ramon Sanabria, Florian Metze, Alex Waibel:
Subword and Crossword Units for CTC Acoustic Models. CoRR abs/1712.06855 (2017)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1712-09673
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1712-09673
Shao-Yen Tseng, Juncheng Li, Yun Wang, Joseph Szurley, Florian Metze, Samarjit Das:
Multiple Instance Deep Learning for Weakly Supervised Audio Event Detection. CoRR abs/1712.09673 (2017)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1712-09680
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1712-09680
Juncheng Li, Yun Wang, Joseph Szurley, Florian Metze, Samarjit Das:
A Light-Weight Multimodal Framework for Improved Environmental Audio Tagging. CoRR abs/1712.09680 (2017)
2016
[c134]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/ITGspeech/RitterMSMW16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ITGspeech/RitterMSMW16
Marvin Ritter, Markus Müller, Sebastian Stüker, Florian Metze, Alex Waibel:
Training Deep Neural Networks for Reverberation Robust Speech Recognition. ITG Symposium on Speech Communication 2016: 1-5
[c133]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MiaoGNKMW16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MiaoGNKMW16
Yajie Miao, Mohammad Gowayyed, Xingyu Na, Tom Ko, Florian Metze, Alexander Waibel:
An empirical exploration of CTC acoustic models. ICASSP 2016: 2623-2627
[c132]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangNM16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangNM16
Yun Wang, Leonardo Neves, Florian Metze:
Audio-based multimedia event detection using deep recurrent neural networks. ICASSP 2016: 2742-2746
[c131]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MetzeRWB16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MetzeRWB16
Florian Metze, Eric Riebling, Anne S. Warlaumont, Elika Bergelson:
Virtual Machines and Containers as a Platform for Experimentation. INTERSPEECH 2016: 1603-1607
[c130]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BatesFMLLP16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BatesFMLLP16
Rebecca Bates, Eric Fosler-Lussier, Florian Metze, Martha A. Larson, Gina-Anne Levow, Emily Mower Provost:
Experiences with Shared Resources for Research and Education in Speech and Language Processing. INTERSPEECH 2016: 1627-1631
[c129]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GaurMB16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GaurMB16
Yashesh Gaur, Florian Metze, Jeffrey P. Bigham:
Manipulating Word Lattices to Incorporate Human Corrections. INTERSPEECH 2016: 3062-3065
[c128]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MiaoM16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MiaoM16
Yajie Miao, Florian Metze:
Open-Domain Audio-Visual Speech Recognition: A Deep Learning Approach. INTERSPEECH 2016: 3414-3418
[c127]
- view
  authority control:
- export record
  dblp key:
  - conf/mir/WangM16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mir/WangM16
Yun Wang, Florian Metze:
Recurrent Support Vector Machines for Audio-Based Multimedia Event Detection. ICMR 2016: 265-269
[c126]
- view
  authority control:
- export record
  dblp key:
  - conf/w4a/GaurLMB16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/w4a/GaurLMB16
Yashesh Gaur, Walter S. Lasecki, Florian Metze, Jeffrey P. Bigham:
The effects of automatic speech recognition quality on human transcription latency. W4A 2016: 23:1-23:8
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/SanabriaMT16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/SanabriaMT16
Ramon Sanabria, Florian Metze, Fernando De la Torre:
Robust end-to-end deep audiovisual speech recognition. CoRR abs/1611.06986 (2016)
2015
[j9]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/MiaoZM15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/MiaoZM15
Yajie Miao, Hao Zhang, Florian Metze:
Speaker Adaptive Training of Deep Neural Network Acoustic Models Using I-Vectors. IEEE ACM Trans. Audio Speech Lang. Process. 23(11): 1938-1949 (2015)
[c125]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/MiaoGM15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/MiaoGM15
Yajie Miao, Mohammad Gowayyed, Florian Metze:
EESEN: End-to-end speech recognition using deep RNN models and WFST-based decoding. ASRU 2015: 167-174
[c124]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MetzeGMSWXZKLLS15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MetzeGMSWXZKLLS15
Florian Metze, Ankur Gandhe, Yajie Miao, Zaid Sheikh, Yun Wang, Di Xu, Hao Zhang, Jungsuk Kim, Ian R. Lane, Wonkyum Lee, Sebastian Stüker, Markus Müller:
Semi-supervised training in low-resource ASR and KWS. ICASSP 2015: 4699-4703
[c123]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhangMM15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhangMM15
Hao Zhang, Yajie Miao, Florian Metze:
Regularizing DNN acoustic models with Gaussian stochastic neurons. ICASSP 2015: 4964-4968
[c122]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/AngueraRBMSP15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/AngueraRBMSP15
Xavier Anguera, Luis Javier Rodríguez-Fuentes, Andi Buzo, Florian Metze, Igor Szöke, Mikel Peñagarikano:
QUESST2014: Evaluating Query-by-Example Speech Search in a zero-resource setting with real-life queries. ICASSP 2015: 5833-5837
[c121]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MiaoM15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MiaoM15
Yajie Miao, Florian Metze:
Distance-aware DNNs for robust speech recognition. INTERSPEECH 2015: 761-765
[c120]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MiaoM15a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MiaoM15a
Yajie Miao, Florian Metze:
On speaker adaptation of long short-term memory recurrent neural networks. INTERSPEECH 2015: 1101-1105
[c119]
- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/MetzeRFPB15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MetzeRFPB15
Florian Metze, Eric Riebling, Eric Fosler-Lussier, Andrew R. Plummer, Rebecca Bates:
The speech recognition virtual kitchen turns one. INTERSPEECH 2015: 2617-2618
[c118]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GaurMMB15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GaurMMB15
Yashesh Gaur, Florian Metze, Yajie Miao, Jeffrey P. Bigham:
Using keyword spotting to help humans correct captioning faster. INTERSPEECH 2015: 2829-2833
[c117]
- view
  - electronic edition @ ceur-ws.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/mediaeval/SzokeRBAMPLX15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mediaeval/SzokeRBAMPLX15
Igor Szöke, Luis Javier Rodríguez-Fuentes, Andi Buzo, Xavier Anguera, Florian Metze, Jorge Proença, Martin Lojka, Xiao Xiong:
Query by Example Search on Speech at Mediaeval 2015. MediaEval 2015
[c116]
- view
  - electronic edition @ nist.gov (open access)
  - details & citations
- export record
  dblp key:
  - conf/trecvid/Yu0XLXCLMGMDCMW15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/trecvid/Yu0XLXCLMGMDCMW15
Shoou-I Yu, Lu Jiang, Zhongwen Xu, Zhenzhong Lan, Shicheng Xu, Xiaojun Chang, Xuanchong Li, Zexi Mao, Chuang Gan, Yajie Miao, Xingzhong Du, Yang Cai, Lara J. Martin, Nikolas Wolfe, Anurag Kumar, Huan Li, Ming Lin, Zhigang Ma, Yi Yang, Deyu Meng, Shiguang Shan, Pinar Duygulu Sahin, Susanne Burger, Florian Metze, Rita Singh, Bhiksha Raj, Teruko Mitamura, Richard M. Stern, Alexander G. Hauptmann:
CMU Informedia@TRECVID 2015: MED/SIN/LNK/SED. TRECVID 2015
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/MiaoGM15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/MiaoGM15
Yajie Miao, Mohammad Gowayyed, Florian Metze:
EESEN: End-to-End Speech Recognition using Deep RNN Models and WFST-based Decoding. CoRR abs/1507.08240 (2015)
2014
[j8]
- view
  authority control:
- export record
  dblp key:
  - journals/computer/KumarMK14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/computer/KumarMK14
Anuj Kumar, Florian Metze, Matthew Kam:
Enabling the Rapid Development and Adoption of Speech-User Interfaces. Computer 47(1): 40-47 (2014)
[j7]
- view
  authority control:
- export record
  dblp key:
  - journals/csl/MetzeABDG14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/MetzeABDG14
Florian Metze, Xavier Anguera, Etienne Barnard, Marelie H. Davel, Guillaume Gravier:
Language independent search in MediaEval's Spoken Web Search task. Comput. Speech Lang. 28(5): 1066-1082 (2014)
[c115]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/MetzeS14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/MetzeS14
Florian Metze, Koichi Shinoda:
Semantics for Large-Scale Multimedia: New Challenges for NLP. ACL (Tutorial Abstracts) 2014: 6
[c114]
- view
  authority control:
- export record
  dblp key:
  - conf/eacl/TsvetkovMD14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eacl/TsvetkovMD14
Yulia Tsvetkov, Florian Metze, Chris Dyer:
Augmenting Translation Models with Simulated Acoustic Confusions for Improved Spoken Language Translation. EACL 2014: 616-625
[c113]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangRM14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangRM14
Yipei Wang, Shourabh Rawat, Florian Metze:
Exploring audio semantic concepts for event-based video retrieval. ICASSP 2014: 1360-1364
[c112]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangRM14a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangRM14a
Yipei Wang, Shourabh Rawat, Florian Metze:
Semi-automatic audio semantic concept discovery for multimedia retrieval. ICASSP 2014: 1375-1379
[c111]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GandheMWL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GandheMWL14
Ankur Gandhe, Florian Metze, Alex Waibel, Ian R. Lane:
Optimization of Neural Network Language Models for keyword search. ICASSP 2014: 4888-4892
[c110]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/MetzeRW14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/MetzeRW14
Florian Metze, Shourabh Rawat, Yipei Wang:
Improved audio features for large-scale multimedia event detection. ICME 2014: 1-6
[c109]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MiaoM14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MiaoM14
Yajie Miao, Florian Metze:
Improving language-universal feature extraction with deep maxout and convolutional neural networks. INTERSPEECH 2014: 800-804
[c108]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MiaoZM14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MiaoZM14
Yajie Miao, Hao Zhang, Florian Metze:
Distributed learning of multilingual DNN feature extractors using GPUs. INTERSPEECH 2014: 830-834
[c107]
- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/PlummerRKMFB14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PlummerRKMFB14
Andrew R. Plummer, Eric Riebling, Anuj Kumar, Florian Metze, Eric Fosler-Lussier, Rebecca Bates:
The speech recognition virtual kitchen: launch party. INTERSPEECH 2014: 2140-2141
[c106]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MiaoZM14a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MiaoZM14a
Yajie Miao, Hao Zhang, Florian Metze:
Towards speaker adaptive training of deep neural network acoustic models. INTERSPEECH 2014: 2189-2193
[c105]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AngueraRSBMP14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AngueraRSBMP14
Xavier Anguera, Luis Javier Rodríguez-Fuentes, Igor Szöke, Andi Buzo, Florian Metze, Mikel Peñagarikano:
Query-by-example spoken term detection on multilingual unconstrained speech. INTERSPEECH 2014: 2459-2463
[c104]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangM14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangM14
Yun Wang, Florian Metze:
An in-depth comparison of keyword specific thresholding and sum-to-one score normalization. INTERSPEECH 2014: 2474-2478
[c103]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GandheML14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GandheML14
Ankur Gandhe, Florian Metze, Ian R. Lane:
Neural network language models for low resource languages. INTERSPEECH 2014: 2615-2619
[c102]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XuM14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XuM14
Di Xu, Florian Metze:
Word-based probabilistic phonetic retrieval for low-resource spoken term detection. INTERSPEECH 2014: 2774-2778
[c101]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/iwslt/0001SSMW14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwslt/0001SSMW14
Markus Müller, Sebastian Stüker, Zaid Sheikh, Florian Metze, Alex Waibel:
Multilingual deep bottle neck features: a study on language selection and training techniques. IWSLT 2014
[c100]
- view
  - electronic edition @ ceur-ws.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/mediaeval/AngueraRSBM14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mediaeval/AngueraRSBM14
Xavier Anguera, Luis Javier Rodríguez-Fuentes, Igor Szöke, Andi Buzo, Florian Metze:
Query by Example Search on Speech at Mediaeval 2014. MediaEval 2014
[c99]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/MartinSMM14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/MartinSMM14
Lara J. Martin, Matthew Stone, Florian Metze, Jack Mostow:
A methodology for using crowdsourced data to measure uncertainty in natural speech. SLT 2014: 95-99
[c98]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/MiaoJZM14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/MiaoJZM14
Yajie Miao, Lu Jiang, Hao Zhang, Florian Metze:
Improvements to speaker adaptive training of deep neural networks. SLT 2014: 165-170
[c97]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/XuWM14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/XuWM14
Di Xu, Yun Wang, Florian Metze:
EM-based phoneme confusion matrix generation for low-resource spoken term detection. SLT 2014: 424-429
[c96]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/TrmalCPKGZMLJKY14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/TrmalCPKGZMLJKY14
Jan Trmal, Guoguo Chen, Daniel Povey, Sanjeev Khudanpur, Pegah Ghahremani, Xiaohui Zhang, Vimal Manohar, Chunxi Liu, Aren Jansen, Dietrich Klakow, David Yarowsky, Florian Metze:
A keyword search system using open source software. SLT 2014: 530-535
[c95]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/sltu/AngueraRSBMP14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sltu/AngueraRSBMP14
Xavier Anguera, Luis Javier Rodríguez-Fuentes, Igor Szöke, Andi Buzo, Florian Metze, Mikel Peñagarikano:
Query-by-example spoken term detection evaluation on low-resource languages. SLTU 2014: 24-31
[c94]
- view
  - electronic edition @ nist.gov (open access)
  - details & citations
- export record
  dblp key:
  - conf/trecvid/Yu0XLXCLMGMDCMW14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/trecvid/Yu0XLXCLMGMDCMW14
Shoou-I Yu, Lu Jiang, Zhongwen Xu, Zhenzhong Lan, Shicheng Xu, Xiaojun Chang, Xuanchong Li, Zexi Mao, Chuang Gan, Yajie Miao, Xingzhong Du, Yang Cai, Lara J. Martin, Nikolas Wolfe, Anurag Kumar, Huan Li, Ming Lin, Zhigang Ma, Yi Yang, Deyu Meng, Shiguang Shan, Pinar Duygulu Sahin, Susanne Burger, Florian Metze, Rita Singh, Bhiksha Raj, Teruko Mitamura, Richard M. Stern, Alexander G. Hauptmann, Anil Armagan, Yicheng Zhao:
Informedia @ TRECVID 2014. TRECVID 2014
2013
[j6]
- view
  authority control:
- export record
  dblp key:
  - journals/ijmir/MetzeDYH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijmir/MetzeDYH13
Florian Metze, Duo Ding, Ehsan Younessian, Alexander G. Hauptmann:
Beyond audio and video retrieval: topic-oriented multimedia summarization. Int. J. Multim. Inf. Retr. 2(2): 131-144 (2013)
[c93]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/NallasamyFWMS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/NallasamyFWMS13
Udhyakumar Nallasamy, Mark C. Fuhs, Monika Woszczyna, Florian Metze, Tanja Schultz:
Neighbour selection and adaptation for rapid speaker-dependent ASR. ASRU 2013: 60-65
[c92]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/MetzeSWGKNN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/MetzeSWGKNN13
Florian Metze, Zaid Sheikh, Alex Waibel, Jonas Gehring, Kevin Kilgour, Quoc Bao Nguyen, Van Huy Nguyen:
Models of tone for tonal and non-tonal languages. ASRU 2013: 261-266
[c91]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/GehringNMW13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/GehringNMW13
Jonas Gehring, Quoc Bao Nguyen, Florian Metze, Alex Waibel:
DNN acoustic modeling with modular multi-lingual feature extraction networks. ASRU 2013: 344-349
[c90]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/MiaoMR13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/MiaoMR13
Yajie Miao, Florian Metze, Shourabh Rawat:
Deep maxout networks for low-resource speech recognition. ASRU 2013: 398-403
[c89]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/GandheQMRLE13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/GandheQMRLE13
Ankur Gandhe, Long Qin, Florian Metze, Alexander I. Rudnicky, Ian R. Lane, Matthias Eck:
Using web text to improve keyword spotting in speech. ASRU 2013: 428-433
[c88]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GehringMMW13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GehringMMW13
Jonas Gehring, Yajie Miao, Florian Metze, Alex Waibel:
Extracting deep bottleneck features using stacked auto-encoders. ICASSP 2013: 3377-3381
[c87]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MiaoMW13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MiaoMW13
Yajie Miao, Florian Metze, Alex Waibel:
Subspace mixture model for low-resource speech recognition in cross-lingual settings. ICASSP 2013: 7339-7343
[c86]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TsvetkovSM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TsvetkovSM13
Yulia Tsvetkov, Zaid Sheikh, Florian Metze:
Identification and modeling of word fragments in spontaneous speech. ICASSP 2013: 7624-7628
[c85]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MiaoMW13a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MiaoMW13a
Yajie Miao, Florian Metze, Alex Waibel:
Learning discriminative basis coefficients for eigenspace MLLR unsupervised adaptation. ICASSP 2013: 7927-7931
[c84]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/JansenDGJKCFHMRSCMVBBCDFHLLNPRST13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/JansenDGJKCFHMRSCMVBBCDFHLLNPRST13
Aren Jansen, Emmanuel Dupoux, Sharon Goldwater, Mark Johnson, Sanjeev Khudanpur, Kenneth Church, Naomi Feldman, Hynek Hermansky, Florian Metze, Richard C. Rose, Mike Seltzer, Pascal Clark, Ian McGraw, Balakrishnan Varadarajan, Erin Bennett, Benjamin Börschinger, Justin T. Chiu, Ewan Dunbar, Abdellah Fourtassi, David Harwath, Chia-ying Lee, Keith D. Levin, Atta Norouzian, Vijayaditya Peddinti, Rachael Richardson, Thomas Schatz, Samuel Thomas:
A summary of the 2012 JHU CLSP workshop on zero resource speech technologies and models of early language acquisition. ICASSP 2013: 8111-8115
[c83]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MetzeABDG13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MetzeABDG13
Florian Metze, Xavier Anguera, Etienne Barnard, Marelie H. Davel, Guillaume Gravier:
The spoken web search task at MediaEval 2012. ICASSP 2013: 8121-8125
[c82]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/ijcnlp/JauharCM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnlp/JauharCM13
Sujay Kumar Jauhar, Yun-Nung Chen, Florian Metze:
Prosody-Based Unsupervised Speech Summarization with Two-Layer Mutually Reinforced Random Walk. IJCNLP 2013: 648-654
[c81]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenM13
Yun-Nung Chen, Florian Metze:
Multi-layer mutually reinforced random walk with hidden parameters for improved multi-party meeting summarization. INTERSPEECH 2013: 485-489
[c80]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KumarMWK13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KumarMWK13
Anuj Kumar, Florian Metze, Wenyi Wang, Matthew Kam:
Formalizing expert knowledge for developing accurate speech recognizers. INTERSPEECH 2013: 1121-1125
[c79]
- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/MetzeFB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MetzeFB13
Florian Metze, Eric Fosler-Lussier, Rebecca Bates:
The speech recognition virtual kitchen. INTERSPEECH 2013: 1858-1860
[c78]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MiaoM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MiaoM13
Yajie Miao, Florian Metze:
Improving low-resource CD-DNN-HMM using dropout and multilingual DNN training. INTERSPEECH 2013: 2237-2241
[c77]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RawatSBDWM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RawatSBDWM13
Shourabh Rawat, Peter F. Schulam, Susanne Burger, Duo Ding, Yipei Wang, Florian Metze:
Robust audio-codebooks for large-scale event detection in consumer videos. INTERSPEECH 2013: 2929-2933
[c76]
- view
  - electronic edition @ ceur-ws.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/mediaeval/AngueraMBSR13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mediaeval/AngueraMBSR13
Xavier Anguera, Florian Metze, Andi Buzo, Igor Szöke, Luis Javier Rodríguez-Fuentes:
The Spoken Web Search Task. MediaEval 2013
[c75]
- view
  - electronic edition @ nist.gov (open access)
  - details & citations
- export record
  dblp key:
  - conf/trecvid/Lan0YGR0XSLWS0M13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/trecvid/Lan0YGR0XSLWS0M13
Zhenzhong Lan, Lu Jiang, Shoou-I Yu, Chenqiang Gao, Shourabh Rawat, Yang Cai, Shicheng Xu, Haoquan Shen, Xuanchong Li, Yipei Wang, Waito Sze, Yan Yan, Zhigang Ma, Nicolas Ballas, Deyu Meng, Wei Tong, Yi Yang, Susanne Burger, Florian Metze, Rita Singh, Bhiksha Raj, Richard M. Stern, Teruko Mitamura, Eric Nyberg, Alexander G. Hauptmann:
Informedia@TRECVID 2013. TRECVID 2013
2012
[j5]
- view
  authority control:
- export record
  dblp key:
  - journals/spm/LivescuFM12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spm/LivescuFM12
Karen Livescu, Eric Fosler-Lussier, Florian Metze:
Subword Modeling for Automatic Speech Recognition: Past, Present, and Emerging Approaches. IEEE Signal Process. Mag. 29(6): 44-57 (2012)
[c74]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/BlackBDMMPPPSV12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/BlackBDMMPPPSV12
Alan W. Black, H. Timothy Bunnell, Ying Dou, Prasanna Kumar Muthukumar, Florian Metze, Daniel Perry, Tim Polzehl, Kishore Prahallad, Stefan Steidl, Callie Vaughn:
Articulatory features for expressive speech synthesis. ICASSP 2012: 4005-4008
[c73]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MetzeRADGHMMPST12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MetzeRADGHMMPST12
Florian Metze, Nitendra Rajput, Xavier Anguera, Marelie H. Davel, Guillaume Gravier, Charl Johannes van Heerden, Gautam Varma Mantena, Armando Muscariello, Kishore Prahallad, Igor Szöke, Javier Tejedor:
The Spoken Web Search Task at MediaEval 2011. ICASSP 2012: 5165-5168
[c72]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/inlg/DingMRSB12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/inlg/DingMRSB12
Duo Ding, Florian Metze, Shourabh Rawat, Peter Franz Schulam, Susanne Burger:
Generating Natural Language Summaries for Multimedia. INLG 2012: 128-130
[c71]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PolzehlSMMMV12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PolzehlSMMMV12
Tim Polzehl, Katrin Schoenenberg, Sebastian Möller, Florian Metze, Gelareh Mohammadi, Alessandro Vinciarelli:
On Speaker-Independent Personality Perception and Prediction from Speech. INTERSPEECH 2012: 258-261
[c70]
- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/MetzeF12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MetzeF12
Florian Metze, Eric Fosler-Lussier:
The Speech Recognition Virtual Kitchen: An Initial Prototype. INTERSPEECH 2012: 1872-1873
[c69]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NallasamyMS12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NallasamyMS12
Udhyakumar Nallasamy, Florian Metze, Tanja Schultz:
Enhanced Polyphone Decision Tree Adaptation for Accented Speech Recognition. INTERSPEECH 2012: 1902-1905
[c68]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JinSRBDM12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JinSRBDM12
Qin Jin, Peter Franz Schulam, Shourabh Rawat, Susanne Burger, Duo Ding, Florian Metze:
Event-based Video Retrieval Using Audio. INTERSPEECH 2012: 2085-2088
[c67]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenM12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenM12
Yun-Nung Chen, Florian Metze:
Integrating Intra-Speaker Topic Modeling and Temporal-Based Inter-Speaker Topic Modeling in Random Walk for Improved Multi-Party Meeting Summarization. INTERSPEECH 2012: 2346-2349
[c66]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VuBMS12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VuBMS12
Ngoc Thang Vu, Wojtek Breiter, Florian Metze, Tanja Schultz:
Initialization Schemes for Multilayer Perceptron Training and their Impact on ASR Performance using Multilingual Data. INTERSPEECH 2012: 2586-2589
[c65]
- view
  - electronic edition @ ceur-ws.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/mediaeval/MetzeBDHAGR12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mediaeval/MetzeBDHAGR12
Florian Metze, Etienne Barnard, Marelie H. Davel, Charl Johannes van Heerden, Xavier Anguera, Guillaume Gravier, Nitendra Rajput:
The Spoken Web Search Task. MediaEval 2012
[c64]
- view
  authority control:
- export record
  dblp key:
  - conf/mir/DingMRSBYBCH12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mir/DingMRSBYBCH12
Duo Ding, Florian Metze, Shourabh Rawat, Peter Franz Schulam, Susanne Burger, Ehsan Younessian, Lei Bao, Michael G. Christel, Alexander G. Hauptmann:
Beyond audio and video retrieval: towards multimedia summarization. ICMR 2012: 2
[c63]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/mlslp/NallasamyMS12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mlslp/NallasamyMS12
Udhyakumar Nallasamy, Florian Metze, Tanja Schultz:
Semi-supervised learning for speech recognition in the context of accent adaptation. MLSLP 2012: 13-17
[c62]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/FriedlandEM12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/FriedlandEM12
Gerald Friedland, Daniel P. W. Ellis, Florian Metze:
AMVA'12: ACM international workshop on audio and multimedia methods for large-scale video analysis. ACM Multimedia 2012: 1513-1514
[c61]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/naacl/ChenM12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/ChenM12
Yun-Nung Chen, Florian Metze:
Intra-Speaker Topic Modeling for Improved Multi-Party Meeting Summarization with Integrated Random Walk. HLT-NAACL 2012: 377-381
[c60]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/NallasamyMS12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/NallasamyMS12
Udhyakumar Nallasamy, Florian Metze, Tanja Schultz:
Active learning for accent adaptation in Automatic Speech Recognition. SLT 2012: 360-365
[c59]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/ChenM12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/ChenM12
Yun-Nung Chen, Florian Metze:
Two-layer mutually reinforced random walk for improved multi-party meeting summarization. SLT 2012: 461-466
[c58]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/sltu/WeinerVTMSLCL12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sltu/WeinerVTMSLCL12
Jochen Weiner, Ngoc Thang Vu, Dominic Telaar, Florian Metze, Tanja Schultz, Dau-Cheng Lyu, Engsiong Chng, Haizhou Li:
Integration of language identification into a recognition system for spoken conversations containing code-Switches. SLTU 2012: 76-79
[c57]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/sltu/VuMS12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sltu/VuMS12
Ngoc Thang Vu, Florian Metze, Tanja Schultz:
Multilingual bottle-neck features and its application for under-resourced languages. SLTU 2012: 90-93
[c56]
- view
  - electronic edition @ nist.gov (open access)
  - details & citations
- export record
  dblp key:
  - conf/trecvid/YuXDSVLCRSMBJT012
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/trecvid/YuXDSVLCRSMBJT012
Shoou-I Yu, Zhongwen Xu, Duo Ding, Waito Sze, Francisco Vicente, Zhenzhong Lan, Yang Cai, Shourabh Rawat, Peter F. Schulam, Nisarga Markandaiah, Sohail Bahmani, Antonio Juárez, Wei Tong, Yi Yang, Susanne Burger, Florian Metze, Rita Singh, Bhiksha Raj, Richard M. Stern, Teruko Mitamura, Eric Nyberg, Lu Jiang, Qiang Chen, Lisa M. Brown, Ankur Datta, Quanfu Fan, Rogério Schmidt Feris, Shuicheng Yan, Alexander G. Hauptmann, Sharath Pankanti:
Informedia @TRECVID 2012. TRECVID 2012
[e3]
- view
- export record
  dblp key:
  - conf/mediaeval/2012
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mediaeval/2012
Martha A. Larson, Sebastian Schmiedeke, Pascal Kelm, Adam Rae, Vasileios Mezaris, Tomas Piatrik, Mohammad Soleymani, Florian Metze, Gareth J. F. Jones:
Working Notes Proceedings of the MediaEval 2012 Workshop, Santa Croce in Fossabanda, Pisa, Italy, October 4-5, 2012. CEUR Workshop Proceedings 927, CEUR-WS.org 2012 [contents]
2011
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/PolzehlSMW11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/PolzehlSMW11
Tim Polzehl, Alexander Schmitt, Florian Metze, Michael Wagner:
Anger recognition in speech using acoustic and linguistic cues. Speech Commun. 53(9-10): 1198-1209 (2011)
[c55]
- view
  authority control:
- export record
  dblp key:
  - books/daglib/p/PolzehlSM11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/books/daglib/p/PolzehlSM11
Tim Polzehl, Alexander Schmitt, Florian Metze:
Salient Features for Anger Recognition in German and English IVR Portals. IWSDS 2011: 83-105
[c54]
- view
  authority control:
- export record
  dblp key:
  - conf/hci/MetzeBP11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/hci/MetzeBP11
Florian Metze, Alan W. Black, Tim Polzehl:
A Review of Personality in Voice-Based Man Machine Interaction. HCI (2) 2011: 358-367
[c53]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NallasamyGMJSS11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NallasamyGMJSS11
Udhyakumar Nallasamy, Michael Garbus, Florian Metze, Qin Jin, Thomas Schaaf, Tanja Schultz:
Analysis of Dialectal Influence in Pan-Arabic ASR. INTERSPEECH 2011: 1721-1724
[c52]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PolzehlMM11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PolzehlMM11
Tim Polzehl, Sebastian Möller, Florian Metze:
Modeling Speaker Personality Using Voice. INTERSPEECH 2011: 2369-2372
[c51]
- view
  - electronic edition @ ceur-ws.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/mediaeval/RajputM11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mediaeval/RajputM11
Nitendra Rajput, Florian Metze:
Spoken Web Search. MediaEval 2011
[c50]
- view
  - electronic edition @ nist.gov (open access)
  - details & citations
- export record
  dblp key:
  - conf/trecvid/BaoZYL0OJTLLGBM11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/trecvid/BaoZYL0OJTLLGBM11
Lei Bao, Longfei Zhang, Shoou-I Yu, Zhen-zhong Lan, Lu Jiang, Arnold Overwijk, Qin Jin, Shohei Takahashi, Brian Langner, Yuanpeng Li, Michael Garbus, Susanne Burger, Florian Metze, Alexander G. Hauptmann:
Informedia@TRECVID 2011: Surveillance Event Detection. TRECVID 2011
[e2]
- view
- export record
  dblp key:
  - conf/mediaeval/2011
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mediaeval/2011
Martha A. Larson, Adam Rae, Claire-Hélène Demarty, Christoph Kofler, Florian Metze, Raphaël Troncy, Vasileios Mezaris, Gareth J. F. Jones:
Working Notes Proceedings of the MediaEval 2011 Workshop, Santa Croce in Fossabanda, Pisa, Italy, September 1-2, 2011. CEUR Workshop Proceedings 807, CEUR-WS.org 2011 [contents]
2010
[c49]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SchullerMSBEP10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SchullerMSBEP10
Björn W. Schuller, Florian Metze, Stefan Steidl, Anton Batliner, Florian Eyben, Tim Polzehl:
Late fusion of individual engines for improved recognition of negative emotion in speech - learning vs. democratic vote. ICASSP 2010: 5230-5233
[c48]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SchaafM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SchaafM10
Thomas Schaaf, Florian Metze:
Analysis of gender normalization using MLP and VTLN features. INTERSPEECH 2010: 306-309
[c47]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MetzeBEPSS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MetzeBEPSS10
Florian Metze, Anton Batliner, Florian Eyben, Tim Polzehl, Björn W. Schuller, Stefan Steidl:
Emotion recognition using imperfect speech recognition. INTERSPEECH 2010: 478-481
[c46]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HsiaoMS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HsiaoMS10
Roger Hsiao, Florian Metze, Tanja Schultz:
Improvements to generalized discriminative feature transformation for speech recognition. INTERSPEECH 2010: 1361-1364
[c45]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MetzeHJNS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MetzeHJNS10
Florian Metze, Roger Hsiao, Qin Jin, Udhyakumar Nallasamy, Tanja Schultz:
The 2010 CMU GALE speech-to-text system. INTERSPEECH 2010: 1501-1504
[c44]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LarsonOMKJ10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LarsonOMKJ10
Martha A. Larson, Roeland Ordelman, Florian Metze, Wessel Kraaij, Franciska de Jong:
Multimedia content with a speech track: ACM multimedia 2010 workshop on searching spontaneous conversational speech. ACM Multimedia 2010: 1747-1748
[c43]
- view
  authority control:
- export record
  dblp key:
  - conf/semco/PolzehlMM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/semco/PolzehlMM10
Tim Polzehl, Sebastian Möller, Florian Metze:
Automatically Assessing Personality from Speech. ICSC 2010: 134-140
[c42]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/PolzehlMM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/PolzehlMM10
Tim Polzehl, Sebastian Möller, Florian Metze:
Automatically assessing acoustic manifestations of personality in speech. SLT 2010: 7-12
[c41]
- view
  - electronic edition @ nist.gov (open access)
  - details & citations
- export record
  dblp key:
  - conf/trecvid/LiBGOLZYCMH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/trecvid/LiBGOLZYCMH10
Huan Li, Lei Bao, Zan Gao, Arnold Overwijk, Wei Liu, Longfei Zhang, Shoou-I Yu, Ming-yu Chen, Florian Metze, Alexander G. Hauptmann:
Informedia @ TRECVID2010. TRECVID 2010
[e1]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/2010sscs
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/2010sscs
Martha A. Larson, Roeland Ordelman, Florian Metze, Franciska de Jong, Wessel Kraaij:
Proceedings of the 2010 International Workshop on Searching Spontaneous Conversational Speech, SSCS '10, Firenze, Italy, October 29, 2010. ACM 2010, ISBN 978-1-4503-0162-6 [contents]

2000 – 2009

see FAQ

What is the meaning of the colors in the publication lists?

2009
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/uais/MetzeEBBS09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/uais/MetzeEBBS09
Florian Metze, Roman Englert, Udo Bub, Felix Burkhardt, Joachim Stegmann:
Getting closer: tailored human-computer speech dialog. Univers. Access Inf. Soc. 8(2): 97-108 (2009)
[c40]
- view
  authority control:
- export record
  dblp key:
  - conf/hci/MetzeWSSM09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/hci/MetzeWSSM09
Florian Metze, Ina Wechsung, Stefan Schaffer, Julia Seebode, Sebastian Möller:
Reliable Evaluation of Multimodal Dialogue Systems. HCI (2) 2009: 75-83
[c39]
- view
  authority control:
- export record
  dblp key:
  - conf/hci/WechsungESSMM09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/hci/WechsungESSMM09
Ina Wechsung, Klaus-Peter Engelbrecht, Stefan Schaffer, Julia Seebode, Florian Metze, Sebastian Möller:
Usability Evaluation of Multimodal Interfaces: Is the Whole the Sum of Its Parts? HCI (2) 2009: 113-119
[c38]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/BurkhardtPSMH09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/BurkhardtPSMH09
Felix Burkhardt, Tim Polzehl, Joachim Stegmann, Florian Metze, Richard Huber:
Detecting real life anger. ICASSP 2009: 4761-4764
[c37]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SeebodeSWM09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SeebodeSWM09
Julia Seebode, Stefan Schaffer, Ina Wechsung, Florian Metze:
Influence of training on direct and indirect measures for the evaluation of multimodal systems. INTERSPEECH 2009: 300-303
[c36]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PolzehlSKWM09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PolzehlSKWM09
Tim Polzehl, Shiva Sundaram, Hamed Ketabdar, Michael Wagner, Florian Metze:
Emotion classification in children's speech using fusion of acoustic and linguistic features. INTERSPEECH 2009: 340-343
[c35]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WechsungENSSMM09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WechsungENSSMM09
Ina Wechsung, Klaus-Peter Engelbrecht, Anja B. Naumann, Stefan Schaffer, Julia Seebode, Florian Metze, Sebastian Möller:
Predicting the quality of multimodal systems based on judgments of single modalities. INTERSPEECH 2009: 1827-1830
[c34]
- view
  - electronic edition @ gi.de
  - details & citations
- export record
  dblp key:
  - conf/mc/EnglertM09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mc/EnglertM09
Roman Englert, Florian Metze:
Digital Signage mit Interaktiven Displays. MuC (Workshopband) 2009: 3-5
[c33]
- view
  - electronic edition @ gi.de
  - details & citations
- export record
  dblp key:
  - conf/mc/SchafferSWMM09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mc/SchafferSWMM09
Stefan Schaffer, Julia Seebode, Ina Wechsung, Florian Metze, Sebastian Möller:
Benutzerstudien zur Bewertung multimodaler, interaktiver Anzeigetafeln in unterschiedlichen Entwicklungsstufen. MuC (Workshopband) 2009: 22-27
[c32]
- view
  - electronic edition @ gi.de
  - details & citations
- export record
  dblp key:
  - conf/mc/WechsungESSMM09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mc/WechsungESSMM09
Ina Wechsung, Klaus-Peter Engelbrecht, Julia Seebode, Stefan Schaffer, Florian Metze, Sebastian Möller:
Usability-Evaluation multimodaler Schnittstellen: Ist das Ganze die Summe seiner Teile? MuC 2009: 495-498
[c31]
- view
  authority control:
- export record
  dblp key:
  - conf/semco/MetzePW09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/semco/MetzePW09
Florian Metze, Tim Polzehl, Michael Wagner:
Fusion of Acoustic and Linguistic Features for Emotion Detection. ICSC 2009: 153-160
2008
[c30]
- view
  authority control:
- export record
  dblp key:
  - conf/iat/WetzkerUHBAM08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iat/WetzkerUHBAM08
Robert Wetzker, Winfried Umbrath, Leonhard Hennig, Christian Bauckhage, Tansu Alpcan, Florian Metze:
Tailoring Taxonomies for Efficient Text Categorization and Expert Finding. Web Intelligence/IAT Workshops 2008: 459-462
[c29]
- view
  authority control:
- export record
  dblp key:
  - conf/icpr/WetzkerPKBAM08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icpr/WetzkerPKBAM08
Robert Wetzker, Till Plumbaum, Alexander Korth, Christian Bauckhage, Tansu Alpcan, Florian Metze:
Detecting trends in social bookmarking systems using a probabilistic generative model and smoothing. ICPR 2008: 1-4
[c28]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MetzeEBKS08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MetzeEBKS08
Florian Metze, Roman Englert, Udo Bub, Ingmar Kliche, Thomas Scheerbarth:
User perception of multi-modal interfaces for mobile applications. INTERSPEECH 2008: 2470-2473
2007
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/Metze07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/Metze07
Florian Metze:
Discriminative speaker adaptation using articulatory features. Speech Commun. 49(5): 348-360 (2007)
[c27]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/AjmeraM07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/AjmeraM07
Jitendra Ajmera, Florian Metze:
Spotting using Durational Entropy. ICASSP (4) 2007: 973-976
[c26]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MetzeAEBBSMHABL07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MetzeAEBBSMHABL07
Florian Metze, Jitendra Ajmera, Roman Englert, Udo Bub, Felix Burkhardt, Joachim Stegmann, Christian A. Müller, Richard Huber, Bernt Andrassy, Josef G. Bauer, Bernhard Littel:
Comparison of Four Approaches to Age and Gender Recognition for Telephone Applications. ICASSP (4) 2007: 1089-1092
[c25]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/naacl/Metze07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/Metze07
Florian Metze:
On using Articulatory Features for Discriminative Speaker Adaptation. HLT-NAACL (Short Papers) 2007: 117-120
[c24]
- view
  authority control:
- export record
  dblp key:
  - conf/semco/MetzeBA07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/semco/MetzeBA07
Florian Metze, Christian Bauckhage, Tansu Alpcan:
The "Spree" Expert Finding System. ICSC 2007: 551-558
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/smc/BauckhageAAMWIA07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/smc/BauckhageAAMWIA07
Christian Bauckhage, Tansu Alpcan, Sachin Agarwal, Florian Metze, Robert Wetzker, Milena Ilic, Sahin Albayrak:
An intelligent knowledge sharing system for web communities. SMC 2007: 3069-3074
2006
[c22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Metze06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Metze06
Florian Metze:
Articulatory features for "meeting" speech recognition. INTERSPEECH 2006
2005
[b1]
- view
  authority control:
- export record
  dblp key:
  - phd/de/Metze2005
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/phd/de/Metze2005
Florian Metze:
Articulatory features for conversational speech recognition. Karlsruhe Institute of Technology, Germany, 2005, pp. 1-169
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MetzeFPW05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MetzeFPW05
Florian Metze, Christian Fügen, Yue Pan, Alex Waibel:
Automatically Transcribing Meetings using Distant Microphones. ICASSP (1) 2005: 989-992
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/mlmi/MetzeGHKRWWCRVBCCRABR05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mlmi/MetzeGHKRWWCRVBCCRABR05
Florian Metze, Petra Gieselmann, Hartwig Holzapfel, Tobias Kluge, Ivica Rogina, Alex Waibel, Matthias Wölfel, James L. Crowley, Patrick Reignier, Dominique Vaufreydaz, François Bérard, Bérangère Cohen, Joëlle Coutaz, Sylvie Rouillard, Victoria Arranz, Manuel Bertrán, Horacio Rodríguez:
The "FAME" Interactive Space. MLMI 2005: 126-137
2004
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/dagm/KrattMSW04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dagm/KrattMSW04
Jan Kratt, Florian Metze, Rainer Stiefelhagen, Alex Waibel:
Large Vocabulary Audio-Visual Speech Recognition Using the Janus Speech Recognition Toolkit. DAGM-Symposium 2004: 488-495
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SoltauYMFJJ04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SoltauYMFJJ04
Hagen Soltau, Hua Yu, Florian Metze, Christian Fügen, Qin Jin, Szu-Chen Stan Jou:
The 2003 ISL rich transcription system for conversational telephony speech. ICASSP (1) 2004: 773-776
[c17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SchultzJLPMF04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SchultzJLPMF04
Tanja Schultz, Qin Jin, Kornel Laskowski, Yue Pan, Florian Metze, Christian Fügen:
Issues in meeting transcription - the ISL meeting transcription system. INTERSPEECH 2004: 1709-1712
2003
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/StukerSMW03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/StukerSMW03
Sebastian Stüker, Tanja Schultz, Florian Metze, Alex Waibel:
Multilingual articulatory features. ICASSP (1) 2003: 144-147
[c15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/StukerMSW03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/StukerMSW03
Sebastian Stüker, Florian Metze, Tanja Schultz, Alex Waibel:
Integrating multilingual articulatory features into speech recognition. INTERSPEECH 2003: 1033-1036
[c14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ManaBCBMMM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ManaBCBMMM03
Nadia Mana, Susanne Burger, Roldano Cattoni, Laurent Besacier, Victoria MacLaren, John W. McDonough, Florian Metze:
The NESPOLE! voIP multilingual corpora in tourism and medical domains. INTERSPEECH 2003: 1589-1592
2002
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/LavieMCC02
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/LavieMCC02
Alon Lavie, Florian Metze, Roldano Cattoni, Erica Costantini:
A Multi-Perspective Evaluation of the NESPOLE! Speech-to-Speech Translation System. Speech-to-Speech Translation@ACL 2002: 121-128
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SoltauMFW02
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SoltauMFW02
Hagen Soltau, Florian Metze, Christian Fügen, Alex Waibel:
Efficient language model lookahead through polymorphic linguistic context assignment. ICASSP 2002: 709-712
[c11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SoltauMW02
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SoltauMW02
Hagen Soltau, Florian Metze, Alex Waibel:
Compensating for hyperarticulation by modeling articulatory properties. INTERSPEECH 2002: 841-844
[c10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MetzeW02
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MetzeW02
Florian Metze, Alex Waibel:
A flexible stream architecture for ASR using articulatory features. INTERSPEECH 2002: 2133-2136
2001
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SoltauSMW01
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SoltauSMW01
Hagen Soltau, Thomas Schaaf, Florian Metze, Alex Waibel:
The ISL evaluation system for Verbmobil-II. ICASSP 2001: 65-68
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/McDonoughMSW01
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/McDonoughMSW01
John W. McDonough, Florian Metze, Hagen Soltau, Alex Waibel:
Speaker compensation with sine-log all-pass transforms. ICASSP 2001: 369-372
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WaibelBMRSSSYZ01
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WaibelBMRSSSYZ01
Alex Waibel, Michael Bett, Florian Metze, Klaus Ries, Thomas Schaaf, Tanja Schultz, Hagen Soltau, Hua Yu, Klaus Zechner:
Advances in automatic meeting record creation and access. ICASSP 2001: 597-600
[c6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BurgerBCMM01
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BurgerBCMM01
Susanne Burger, Laurent Besacier, Paolo Coletti, Florian Metze, Céline Morel:
The nespole! voIP dialogue database. INTERSPEECH 2001: 2043-2046
[c5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MetzeMS01
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MetzeMS01
Florian Metze, John W. McDonough, Hagen Soltau:
Speech recognition over netmeeting connections. INTERSPEECH 2001: 2389-2392
[c4]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/naacl/WaibelYSPBWSSM01
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/WaibelYSPBWSSM01
Alex Waibel, Hua Yu, Tanja Schultz, Yue Pan, Michael Bett, Martin Westphal, Hagen Soltau, Thomas Schaaf, Florian Metze:
Advances in meeting recognition. HLT 2001
2000
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/nn/AlbrechtBKMT00
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/nn/AlbrechtBKMT00
Sebastian Albrecht, Jan Busch, Martin Kloppenburg, Florian Metze, Paul Tavan:
Generalized radial basis function networks for classification and novelty detection: self-organization of optimal Bayesian decision. Neural Networks 13(10): 1075-1093 (2000)
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MetzeKSSS00
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MetzeKSSS00
Florian Metze, Thomas Kemp, Thomas Schaaf, Tanja Schultz, Hagen Soltau:
Confidence measure based language identification. ICASSP 2000: 1827-1830
[c2]
- no documents available
  - details & citations
- export record
  dblp key:
  - conf/konvens/MetzeK00
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/konvens/MetzeK00
Florian Metze, Thomas Kemp:
Das View4You- System: End-to-End Evaluation. KONVENS 2000: 273-278

1990 – 1999

see FAQ

What is the meaning of the colors in the publication lists?

1996
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/dexaw/ZborilM96
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dexaw/ZborilM96
Daniel Zboril, Florian Metze:
Indeterminateness in Qualitative and Quantitative Reasoning. DEXA Workshop 1996: 262-267

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.