default search action
Jindrich Zdánský
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2023
- [c49]Martin Polácek, Petr Cerva, Jindrich Zdánský, Lenka Weingartová:
Online Punctuation Restoration using ELECTRA Model for streaming ASR Systems. INTERSPEECH 2023: 446-450 - [c48]Lukás Mateju, Jan Nouza, Petr Cerva, Jindrich Zdánský, Frantisek Kynych:
Combining Multilingual Resources and Models to Develop State-of-the-Art E2E ASR for Swedish. INTERSPEECH 2023: 3252-3256 - [c47]Frantisek Kynych, Jindrich Zdánský, Petr Cerva, Lukás Mateju:
Online Speaker Diarization Using Optimized SE-ResNet Architecture. TSD 2023: 176-187 - [c46]Jan Nouza, Lukás Mateju, Petr Cerva, Jindrich Zdánský:
Developing State-of-the-Art End-to-End ASR for Norwegian. TSD 2023: 200-213 - 2022
- [j4]Jirí Málek, Jakub Janský, Zbynek Koldovský, Tomás Kounovský, Jaroslav Cmejla, Jindrich Zdánský:
Target Speech Extraction: Independent Vector Extraction Guided by Supervised Speaker Identification. IEEE ACM Trans. Audio Speech Lang. Process. 30: 2295-2309 (2022) - [c45]Lukás Mateju, Frantisek Kynych, Petr Cerva, Jirí Málek, Jindrich Zdánský:
Overlapped Speech Detection in Broadcast Streams Using X-vectors. INTERSPEECH 2022: 4606-4610 - [c44]Jan Nouza, Petr Cerva, Jindrich Zdánský:
Lexicon-based vs. Lexicon-free ASR for Norwegian Parliament Speech Transcription. TSD 2022: 401-409 - 2021
- [j3]Petr Cerva, Lukás Mateju, Jindrich Zdánský, Radek Safarík, Jan Nouza:
Identification of related languages from spoken data: Moving from off-line to on-line scenario. Comput. Speech Lang. 68: 101180 (2021) - [c43]Jirí Málek, Jakub Janský, Tomás Kounovský, Zbynek Koldovský, Jindrich Zdánský:
Blind Extraction of Moving Audio Source in a Challenging Environment Supported by Speaker Identification Via X-Vectors. ICASSP 2021: 226-230 - [c42]Lukás Mateju, Frantisek Kynych, Petr Cerva, Jindrich Zdánský, Jirí Málek:
Using X-Vectors for Speech Activity Detection in Broadcast Streams. Interspeech 2021: 1474-1478 - [c41]Petr Cerva, Lukás Mateju, Frantisek Kynych, Jindrich Zdánský, Jan Nouza:
Identification of Scandinavian Languages from Speech Using Bottleneck Features and X-Vectors. TDS 2021: 371-381 - [i3]Jirí Málek, Jakub Janský, Zbynek Koldovský, Tomás Kounovský, Jaroslav Cmejla, Jindrich Zdánský:
Blind Extraction of Target Speech Source Guided by Supervised Speaker Identification via X-vectors. CoRR abs/2111.03482 (2021) - 2020
- [c40]Josef Chaloupka, Karel Palecek, Petr Cerva, Jindrich Zdánský:
Optical Character Recognition for Audio-Visual Broadcast Transcription System. CogInfoCom 2020: 229-232 - [c39]Jakub Janský, Jirí Málek, Jaroslav Cmejla, Tomás Kounovský, Zbynek Koldovský, Jindrich Zdánský:
Adaptive Blind Audio Source Extraction Supervised By Dominant Speaker Identification Using X-Vectors. ICASSP 2020: 676-680 - [c38]Jirí Málek, Jindrich Zdánský:
Voice-Activity and Overlapped Speech Detection Using x-Vectors. TDS 2020: 366-376 - [c37]Jan Nouza, Petr Cerva, Jindrich Zdánský:
Very Fast Keyword Spotting System with Real Time Factor Below 0.01. TDS 2020: 426-436 - [i2]Jan Nouza, Petr Cerva, Jindrich Zdánský:
Very Fast Keyword Spotting System with Real Time Factor below 0.01. CoRR abs/2007.10706 (2020)
2010 – 2019
- 2019
- [c36]Lukás Mateju, Petr Cerva, Jindrich Zdánský:
An Approach to Online Speaker Change Point Detection Using DNNs and WFSTs. INTERSPEECH 2019: 649-653 - [c35]Jirí Málek, Jindrich Zdánský:
On Practical Aspects of Multi-condition Training Based on Augmentation for Reverberation-/Noise-Robust Speech Recognition. TSD 2019: 251-263 - [i1]Jakub Janský, Jirí Málek, Jaroslav Cmejla, Tomás Kounovský, Zbynek Koldovský, Jindrich Zdánský:
Adaptive blind audio source extraction supervised by dominant speaker identification using x-vectors. CoRR abs/1910.11824 (2019) - 2018
- [c34]Jirí Málek, Jindrich Zdánský, Petr Cerva:
Robust Recognition of Speech with Background Music in Acoustically Under-Resourced Scenarios. ICASSP 2018: 5624-5628 - [c33]Lukás Mateju, Petr Cerva, Jindrich Zdánský, Radek Safarík:
Using Deep Neural Networks for Identification of Slavic Languages from Acoustic Signal. INTERSPEECH 2018: 1803-1807 - [c32]Jirí Málek, Jindrich Zdánský, Petr Cerva:
Robust Recognition of Conversational Telephone Speech via Multi-condition Training and Data Augmentation. TSD 2018: 324-333 - 2017
- [c31]Jirí Málek, Jindrich Zdánský, Petr Cerva:
Robust Automatic Recognition of Speech with background music. ICASSP 2017: 5210-5214 - [c30]Lukás Mateju, Petr Cerva, Jindrich Zdánský, Jirí Málek:
Speech Activity Detection in online broadcast transcription using Deep Neural Networks and Weighted Finite State Transducers. ICASSP 2017: 5460-5464 - 2016
- [c29]Lukás Mateju, Petr Cerva, Jindrich Zdánský:
Investigation into the Use of WFSTs and DNNs for Speech Activity Detection in Broadcast Data Transcription. ICETE (Selected Papers) 2016: 341-358 - [c28]Lukás Mateju, Petr Cerva, Jindrich Zdánský:
Study on the Use of Deep Neural Networks for Speech Activity Detection in Broadcast Recordings. SIGMAP 2016: 45-51 - 2015
- [c27]Jirí Málek, Jan Silovský, Petr Cerva, Zbynek Koldovský, Jan Nouza, Jindrich Zdánský:
Compensation of nonlinear distortions in speech for automatic recognition. TSP 2015: 1-5 - 2014
- [c26]Jan Nouza, Petr Cerva, Jindrich Zdánský, Karel Blavka, Marek Bohac, Jan Silovský, Josef Chaloupka, Michaela Kucharová, Ladislav Seps, Jirí Málek, Michal Rott:
Speech-to-text technology to transcribe and disclose 100, 000+ hours of bilingual documents from historical Czech and Czechoslovak radio archive. INTERSPEECH 2014: 964-968 - 2013
- [j2]Petr Cerva, Jan Silovský, Jindrich Zdánský, Jan Nouza, Ladislav Seps:
Speaker-adaptive speech recognition using speaker diarization for improved transcription of large spoken archives. Speech Commun. 55(10): 1033-1046 (2013) - 2012
- [j1]Jan Nouza, Karel Blavka, Petr Cerva, Jindrich Zdánský, Jan Silovský, Marek Bohac, Jan Prazak:
Making Czech Historical Radio Archive Accessible and Searchable for Wide Public. J. Multim. 7(2): 159-169 (2012) - [c25]Jan Silovský, Petr Cerva, Jindrich Zdánský, Jan Nouza:
Study on Integration of Speaker Diarization with Speaker Adaptive Speech Recognition for Broadcast Transcription. INTERSPEECH 2012: 478-481 - [c24]Petr Cerva, Jan Silovský, Jindrich Zdánský, Jan Nouza, Jirí Málek:
Real-Time Lecture Transcription using ASR for Czech Hearing Impaired or Deaf Students. INTERSPEECH 2012: 763-766 - [c23]Jan Silovský, Jindrich Zdánský, Jan Nouza, Petr Cerva, Jan Prazak:
Incorporation of the ASR output in speaker segmentation and clustering within the task of speaker diarization of broadcast streams. MMSP 2012: 118-123 - [c22]Petr Cerva, Jan Silovský, Jindrich Zdánský, Ondrej Smola, Karel Blavka, Karel Palecek, Jan Nouza, Jirí Málek:
Browsing, indexing and automatic transcription of lectures for distance learning. MMSP 2012: 198-202 - [c21]Jan Nouza, Karel Blavka, Jindrich Zdánský, Petr Cerva, Jan Silovský, Marek Bohac, Josef Chaloupka, Michaela Kucharová, Ladislav Seps:
Large-scale processing, indexing and search system for Czech audio-visual cultural heritage archives. MMSP 2012: 337-342 - 2011
- [c20]Jan Silovský, Jan Prazak, Petr Cerva, Jindrich Zdánský, Jan Nouza:
PLDA-Based Clustering for Speaker Diarization of Broadcast Streams. INTERSPEECH 2011: 2909-2912 - [c19]Jan Nouza, Karel Blavka, Marek Bohac, Petr Cerva, Jindrich Zdánský, Jan Silovský, Jan Prazak:
Voice Technology to Enable Sophisticated Access to Historical Audio Archive of the Czech Radio. MM4CH 2011: 27-38
2000 – 2009
- 2009
- [c18]Jan Nouza, Jindrich Zdánský, Petr Cerva, Jan Silovský:
Challenges in Speech Processing of Slavic Languages (Case Studies in Speech Recognition of Czech and Slovak). COST 2102 Training School 2009: 225-241 - [c17]Jan Nouza, Petr Cerva, Jindrich Zdánský:
Very large vocabulary voice dictation for mobile devices. INTERSPEECH 2009: 995-998 - 2008
- [c16]Josef Chaloupka, Jan Nouza, Jindrich Zdánský:
Audio-visual voice command recognition in noisy conditions. AVSP 2008: 25-30 - [c15]Josef Chaloupka, Jan Nouza, Jindrich Zdánský, Petr Cerva, Jan Silovský, Martin Kroul:
Voice Technology Applied for Building a Prototype Smart Room. COST 2102 School (Vietri) 2008: 104-111 - [c14]Jan Silovský, Petr Cerva, Jindrich Zdánský:
MLLR Transforms Based Speaker Recognition in Broadcast Streams. COST 2102 Conference (Prague) 2008: 423-431 - [c13]Jirí Málek, Zbynek Koldovský, Jindrich Zdánský, Jan Nouza:
Enhancement of noisy speech recordings via blind source separation. INTERSPEECH 2008: 159-162 - [c12]Jan Nouza, Jan Silovský, Jindrich Zdánský, Petr Cerva, Martin Kroul, Josef Chaloupka:
Czech-to-slovak adapted broadcast news transcription system. INTERSPEECH 2008: 2683-2686 - [c11]Jindrich Zdánský, Josef Chaloupka, Jan Nouza:
Joint audio-visual processing, representation and indexing of TV news programmes. MMSP 2008: 960-965 - [c10]Petr Cerva, Jindrich Zdánský, Jan Silovský, Jan Nouza:
Study on Speaker Adaptation Methods in the Broadcast News Transcription Task. TSD 2008: 277-284 - 2006
- [c9]Jan Nouza, Jindrich Zdánský, Petr Cerva, Jan Kolorenc:
Continual on-line monitoring of Czech spoken broadcast programs. INTERSPEECH 2006 - [c8]Jindrich Zdánský:
BINSEG: an efficient speaker-based segmentation technique. INTERSPEECH 2006 - [c7]Jan Nouza, Jindrich Zdánský, Petr Cerva, Jan Kolorenc:
A System for Information Retrieval from Large Records of Czech Spoken Data. TSD 2006: 485-492 - 2005
- [c6]Janez Zibert, France Mihelic, Jean-Pierre Martens, Hugo Meinedo, João Paulo Neto, Laura Docío Fernández, Carmen García-Mateo, Petr David, Jindrich Zdánský, Matús Pleva, Anton Cizmar, Andrej Zgank, Zdravko Kacic, Csaba Teleki, Klára Vicsi:
The COST278 broadcast news segmentation and speaker clustering evaluation - overview, methodology, systems, results. INTERSPEECH 2005: 629-632 - [c5]Jindrich Zdánský, Jan Nouza:
Detection of acoustic change-points in audio records via global BIC maximization and dynamic programming. INTERSPEECH 2005: 669-672 - [c4]Jan Nouza, Jindrich Zdánský, Petr David, Petr Cerva, Jan Kolorenc, Dana Nejedlová:
Fully automated system for Czech spoken broadcast transcription with very large (300k+) lexicon. INTERSPEECH 2005: 1681-1684 - 2004
- [c3]Jan Nouza, Dana Nejedlová, Jindrich Zdánský, Jan Kolorenc:
Very large vocabulary speech recognition system for automatic transcription of czech broadcast programs. INTERSPEECH 2004: 409-412 - [c2]Jindrich Zdánský, Petr David, Jan Nouza:
An improved preprocessor for the automatic transcription of broadcast news audio stream. INTERSPEECH 2004: 1065-1068 - [c1]Jan Nouza, Jindrich Zdánský, Petr David:
Fully Automated Approach to Broadcast News Transcription in Czech Language. TSD 2004: 401-408
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-07 21:16 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint