default search action
Barry-John Theobald
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c48]Katherine Metcalf, Miguel Sarabia, Masha Fedzechkina, Barry-John Theobald:
Can You Rely on Synthetic Labellers in Preference-Based Reinforcement Learning? It's Complicated. AAAI 2024: 10128-10136 - [c47]Skyler Seto, Barry-John Theobald, Federico Danieli, Navdeep Jaitly, Dan Busbridge:
REALM: Robust Entropy Adaptive Loss Minimization for Improved Single-Sample Test-Time Adaptation. WACV 2024: 2051-2060 - [i26]Jee-weon Jung, Wangyou Zhang, Jiatong Shi, Zakaria Aldeneh, Takuya Higuchi, Barry-John Theobald, Ahmed Hussen Abdelaziz, Shinji Watanabe:
ESPnet-SPK: full pipeline speaker embedding toolkit with reproducible recipes, self-supervised front-ends, and off-the-shelf models. CoRR abs/2401.17230 (2024) - [i25]Zakaria Aldeneh, Takuya Higuchi, Jee-weon Jung, Skyler Seto, Tatiana Likhomanenko, Stephen Shum, Ahmed Hussen Abdelaziz, Shinji Watanabe, Barry-John Theobald:
Can you Remove the Downstream Model for Speaker Recognition with Self-Supervised Speech Features? CoRR abs/2402.00340 (2024) - [i24]Katherine Metcalf, Miguel Sarabia, Natalie Mackraz, Barry-John Theobald:
Sample-Efficient Preference-based Reinforcement Learning with Dynamics Aware Rewards. CoRR abs/2402.17975 (2024) - [i23]Yong Lin, Skyler Seto, Maartje ter Hoeve, Katherine Metcalf, Barry-John Theobald, Xuan Wang, Yizhe Zhang, Chen Huang, Tong Zhang:
On the Limited Generalization Capability of the Implicit Reward Model Induced by Direct Preference Optimization. CoRR abs/2409.03650 (2024) - [i22]Zakaria Aldeneh, Vimal Thilak, Takuya Higuchi, Barry-John Theobald, Tatiana Likhomanenko:
Towards Automatic Assessment of Self-Supervised Speech Models using Rank. CoRR abs/2409.10787 (2024) - [i21]Li-Wei Chen, Takuya Higuchi, He Bai, Ahmed Hussen Abdelaziz, Alexander Rudnicky, Shinji Watanabe, Tatiana Likhomanenko, Barry-John Theobald, Zakaria Aldeneh:
Exploring Prediction Targets in Masked Pre-Training for Speech Foundation Models. CoRR abs/2409.10788 (2024) - [i20]Zakaria Aldeneh, Takuya Higuchi, Jee-weon Jung, Li-Wei Chen, Stephen Shum, Ahmed Hussen Abdelaziz, Shinji Watanabe, Tatiana Likhomanenko, Barry-John Theobald:
Speaker-IPL: Unsupervised Learning of Speaker Characteristics with i-Vector based Pseudo-Labels. CoRR abs/2409.10791 (2024) - [i19]Bhavika Devnani, Skyler Seto, Zakaria Aldeneh, Alessandro Toso, Elena Menyaylenko, Barry-John Theobald, Jonathan Sheaffer, Miguel Sarabia:
Learning Spatially-Aware Language and Audio Embedding. CoRR abs/2409.11369 (2024) - 2023
- [c46]Katherine Metcalf, Miguel Sarabia, Natalie Mackraz, Barry-John Theobald:
Sample-Efficient Preference-based Reinforcement Learning with Dynamics Aware Rewards. CoRL 2023: 1484-1532 - [c45]Zakaria Aldeneh, Masha Fedzechkina, Skyler Seto, Katherine Metcalf, Miguel Sarabia, Nicholas Apostoloff, Barry-John Theobald:
On the Role of LIP Articulation in Visual Speech Perception. ICASSP 2023: 1-5 - [c44]Trisha Mittal, Zakaria Aldeneh, Masha Fedzechkina, Anurag Ranjan, Barry-John Theobald:
Naturalistic Head Motion Generation from Speech. ICASSP 2023: 1-5 - [c43]Miguel Sarabia, Elena Menyaylenko, Alessandro Toso, Skyler Seto, Zakaria Aldeneh, Shadi Pirhosseinloo, Luca Zappella, Barry-John Theobald, Nicholas Apostoloff, Jonathan Sheaffer:
Spatial LibriSpeech: An Augmented Dataset for Spatial Audio Learning. INTERSPEECH 2023: 3724-3728 - [i18]Miguel Sarabia, Elena Menyaylenko, Alessandro Toso, Skyler Seto, Zakaria Aldeneh, Shadi Pirhosseinloo, Luca Zappella, Barry-John Theobald, Nicholas Apostoloff, Jonathan Sheaffer:
Spatial LibriSpeech: An Augmented Dataset for Spatial Audio Learning. CoRR abs/2308.09514 (2023) - [i17]Skyler Seto, Barry-John Theobald, Federico Danieli, Navdeep Jaitly, Dan Busbridge:
REALM: Robust Entropy Adaptive Loss Minimization for Improved Single-Sample Test-Time Adaptation. CoRR abs/2309.03964 (2023) - 2022
- [i16]Andrew Silva, Katherine Metcalf, Nicholas Apostoloff, Barry-John Theobald:
FedEmbed: Personalized Private Federated Learning. CoRR abs/2202.09472 (2022) - [i15]Zakaria Aldeneh, Masha Fedzechkina, Skyler Seto, Katherine Metcalf, Miguel Sarabia, Nicholas Apostoloff, Barry-John Theobald:
Towards a Perceptual Model for Estimating the Quality of Visual Speech. CoRR abs/2203.10117 (2022) - [i14]Trisha Mittal, Zakaria Aldeneh, Masha Fedzechkina, Anurag Ranjan, Barry-John Theobald:
Naturalistic Head Motion Generation from Speech. CoRR abs/2210.14800 (2022) - [i13]Nico Lingg, Miguel Sarabia, Luca Zappella, Barry-John Theobald:
Contrastive Self-Supervised Learning for Skeleton Representations. CoRR abs/2211.05304 (2022) - [i12]Katherine Metcalf, Miguel Sarabia, Barry-John Theobald:
Rewards Encoding Environment Dynamics Improves Preference-based Reinforcement Learning. CoRR abs/2211.06527 (2022) - [i11]Akshay Mehra, Skyler Seto, Navdeep Jaitly, Barry-John Theobald:
Understanding the Robustness of Multi-Exit Models under Common Corruptions. CoRR abs/2212.01562 (2022) - 2021
- [c42]Nataniel Ruiz, Barry-John Theobald, Anurag Ranjan, Ahmed Hussen Abdelaziz, Nicholas Apostoloff:
MorphGAN: One-Shot Face Synthesis GAN for Detecting Recognition Bias. BMVC 2021: 348 - [c41]Andrew Silva, Barry-John Theobald, Nicholas Apostoloff:
Multimodal Punctuation Prediction with Contextual Dropout. ICASSP 2021: 3980-3984 - [c40]Zakaria Aldeneh, Anushree Prasanna Kumar, Barry-John Theobald, Erik Marchi, Sachin Kajarekar, Devang Naik, Ahmed Hussen Abdelaziz:
On The Role of Visual Cues in Audiovisual Speech Enhancement. ICASSP 2021: 8423-8427 - [i10]Andrew Silva, Barry-John Theobald, Nicholas Apostoloff:
Multimodal Punctuation Prediction with Contextual Dropout. CoRR abs/2102.11012 (2021) - 2020
- [c39]Ahmed Hussen Abdelaziz, Barry-John Theobald, Paul Dixon, Reinhard Knothe, Nicholas Apostoloff, Sachin Kajareker:
Modality Dropout for Improved Performance-driven Talking Faces. ICMI 2020: 378-386 - [i9]Zakaria Aldeneh, Anushree Prasanna Kumar, Barry-John Theobald, Erik Marchi, Sachin Kajarekar, Devang Naik, Ahmed Hussen Abdelaziz:
Self-supervised Learning of Visual Speech Features with Audiovisual Speech Enhancement. CoRR abs/2004.12031 (2020) - [i8]Ahmed Hussen Abdelaziz, Barry-John Theobald, Paul Dixon, Reinhard Knothe, Nicholas Apostoloff, Sachin Kajareker:
Modality Dropout for Improved Performance-driven Talking Faces. CoRR abs/2005.13616 (2020) - [i7]Nataniel Ruiz, Barry-John Theobald, Anurag Ranjan, Ahmed Hussen Abdelaziz, Nicholas Apostoloff:
MorphGAN: One-Shot Face Synthesis GAN for Detecting Recognition Bias. CoRR abs/2012.05225 (2020)
2010 – 2019
- 2019
- [c38]Ahmed Hussen Abdelaziz, Barry-John Theobald, Justin Binder, Gabriele Fanelli, Paul Dixon, Nicholas Apostoloff, Thibaut Weise, Sachin Kajareker:
Speaker-Independent Speech-Driven Visual Speech Synthesis using Domain-Adapted Acoustic Models. ICMI 2019: 220-225 - [c37]Katherine Metcalf, Barry-John Theobald, Garrett Weinberg, Robert Lee, Ing-Marie Jonsson, Russ Webb, Nicholas Apostoloff:
Mirroring to Build Trust in Digital Assistants. INTERSPEECH 2019: 4000-4004 - [i6]Katherine Metcalf, Barry-John Theobald, Garrett Weinberg, Robert Lee, Ing-Marie Jonsson, Russ Webb, Nicholas Apostoloff:
Mirroring to Build Trust in Digital Assistants. CoRR abs/1904.01664 (2019) - [i5]Ahmed Hussen Abdelaziz, Barry-John Theobald, Justin Binder, Gabriele Fanelli, Paul Dixon, Nicholas Apostoloff, Thibaut Weise, Sachin Kajareker:
Speaker-Independent Speech-Driven Visual Speech Synthesis using Domain-Adapted Acoustic Models. CoRR abs/1905.06860 (2019) - 2018
- [c36]Katherine Metcalf, Barry-John Theobald, Nicholas Apostoloff:
Learning Sharing Behaviors with Arbitrary Numbers of Agents. AAMAS 2018: 1232-1240 - [i4]Katherine Metcalf, Barry-John Theobald, Nicholas Apostoloff:
Learning Sharing Behaviors with Arbitrary Numbers of Agents. CoRR abs/1812.04145 (2018) - 2017
- [i3]Helen L. Bear, Richard W. Harvey, Barry-John Theobald, Yuxuan Lan:
Resolution limits on visual speech recognition. CoRR abs/1710.01073 (2017) - [i2]Helen L. Bear, Gari Owen, Richard W. Harvey, Barry-John Theobald:
Some observations on computer lip-reading: moving from the dream to the reality. CoRR abs/1710.01084 (2017) - [i1]Helen L. Bear, Richard W. Harvey, Barry-John Theobald, Yuxuan Lan:
Which phoneme-to-viseme maps best improve visual-only computer lip-reading? CoRR abs/1710.01093 (2017) - 2016
- [j7]Felix Shaw, Barry-John Theobald:
Expressive Modulation of Neutral Visual Speech. IEEE Multim. 23(4): 68-78 (2016) - [j6]Dominic Howell, Stephen J. Cox, Barry-John Theobald:
Visual units and confusion modelling for automatic lip-reading. Image Vis. Comput. 51: 1-12 (2016) - 2015
- [c35]Ausdang Thangthai, Barry-John Theobald:
HMM-based visual speech synthesis using dynamic visemes. AVSP 2015: 88-92 - [c34]Kwanchiva Thangthai, Richard W. Harvey, Stephen J. Cox, Barry-John Theobald:
Improving lip-reading performance for robust audiovisual speech recognition using DNNs. AVSP 2015: 127-131 - [c33]Sarah L. Taylor, Barry-John Theobald, Iain A. Matthews:
A mouth full of words: Visually consistent acoustic redubbing. ICASSP 2015: 4904-4908 - 2014
- [c32]Sarah L. Taylor, Barry-John Theobald, Iain A. Matthews:
The effect of speaking rate on audio and visual speech. ICASSP 2014: 3037-3041 - [c31]Helen L. Bear, Richard W. Harvey, Barry-John Theobald, Yuxuan Lan:
Resolution limits on visual speech recognition. ICIP 2014: 1371-1375 - [c30]Helen L. Bear, Richard W. Harvey, Barry-John Theobald, Yuxuan Lan:
Which Phoneme-to-Viseme Maps Best Improve Visual-Only Computer Lip-Reading? ISVC (2) 2014: 230-239 - 2013
- [c29]Dominic Howell, Barry-John Theobald, Stephen J. Cox:
Confusion modelling for automated lip-reading usingweighted finite-state transducers. AVSP 2013: 197-202 - [c28]Felix Shaw, Barry-John Theobald:
Transforming neutral visual speech into expressive visual speech. AVSP 2013: 203-208 - 2012
- [j5]Luke M. Davis, Barry-John Theobald, Jason Lines, Andoni Toms, Anthony J. Bagnall:
On the Segmentation and Classification of Hand Radiographs. Int. J. Neural Syst. 22(5) (2012) - [j4]Barry-John Theobald, Iain A. Matthews:
Relating Objective and Subjective Performance Measures for AAM-Based Visual Speech Synthesis. IEEE Trans. Speech Audio Process. 20(8): 2378-2387 (2012) - [c27]Yuxuan Lan, Richard W. Harvey, Barry-John Theobald:
Insights into machine lip reading. ICASSP 2012: 4825-4828 - [c26]Yuxuan Lan, Barry-John Theobald, Richard W. Harvey:
View Independent Computer Lip-Reading. ICME 2012: 432-437 - [c25]Luke M. Davis, Barry-John Theobald, Anthony J. Bagnall:
Automated Bone Age Assessment Using Feature Extraction. IDEAL 2012: 43-51 - [c24]Sarah L. Taylor, Moshe Mahler, Barry-John Theobald, Iain A. Matthews:
Dynamic Units of Visual Speech. Symposium on Computer Animation 2012: 275-284 - 2011
- [c23]Luke M. Davis, Barry-John Theobald, Andoni Toms, Anthony J. Bagnall:
On the Extraction and Classification of Hand Outlines. IDEAL 2011: 92-99 - 2010
- [c22]Jacob L. Newman, Barry-John Theobald, Stephen J. Cox:
Limitations of visual speech recognition. AVSP 2010: 1 - [c21]Yuxuan Lan, Barry-John Theobald, Richard W. Harvey, Eng-Jon Ong, Richard Bowden:
Improving visual features for lip-reading. AVSP 2010: 7-3 - [c20]Sarah Hilder, Barry-John Theobald, Richard W. Harvey:
In pursuit of visemes. AVSP 2010: 8-2
2000 – 2009
- 2009
- [j3]Sascha Fagel, Gérard Bailly, Barry-John Theobald:
Animating Virtual Speakers or Singers from Audio: Lip-Synching Facial Animation. EURASIP J. Audio Speech Music. Process. 2009 (2009) - [c19]Sarah Hilder, Richard W. Harvey, Barry-John Theobald:
Comparison of human and machine-based lip-reading. AVSP 2009: 86-89 - [c18]Yuxuan Lan, Richard W. Harvey, Barry-John Theobald, Eng-Jon Ong, Richard Bowden:
Comparing visual features for lipreading. AVSP 2009: 102-106 - [c17]Eng-Jon Ong, Yuxuan Lan, Barry-John Theobald, Richard W. Harvey, Richard Bowden:
Robust facial feature tracking using selected multi-resolution linear predictors. ICCV 2009: 1483-1490 - [c16]Timothy R. Brick, Jeffrey R. Spies, Barry-John Theobald, Iain A. Matthews, Steven M. Boker:
High-presence, low-bandwidth, apparent 3D video-conferencing with a single camera. WIAMIS 2009: 308-311 - [e1]Barry-John Theobald, Richard W. Harvey:
Auditory-Visual Speech Processing, AVSP 2009, Norwich, UK, September 10-13, 2009. ISCA 2009 [contents] - 2008
- [c15]Barry-John Theobald, Nicholas Wilkinson, Iain A. Matthews:
On evaluating synthesised visual speech. AVSP 2008: 7-12 - [c14]Stephen J. Cox, Richard W. Harvey, Yuxuan Lan, Jacob L. Newman, Barry-John Theobald:
The challenge of multispeaker lip-reading. AVSP 2008: 179-184 - [c13]Barry-John Theobald, Nicholas Wilkinson:
A probabilistic trajectory synthesis system for synthesising visual speech. INTERSPEECH 2008: 1857-1860 - [c12]Barry-John Theobald, Sascha Fagel, Gérard Bailly, Frédéric Elisei:
LIPS2008: visual speech synthesis challenge. INTERSPEECH 2008: 2310-2313 - [c11]Barry-John Theobald, Gavin C. Cawley, J. Andrew Bangham, Iain A. Matthews, Nicholas Wilkinson:
Comparing text-driven and speech-driven visual speech synthesisers. INTERSPEECH 2008: 2322 - 2007
- [c10]Barry-John Theobald, Nicholas Wilkinson:
A real-time speech-driven talking head using active appearance models. AVSP 2007: 22 - [c9]Ahmed Bilal Ashraf, Simon Lucey, Jeffrey F. Cohn, Tsuhan Chen, Zara Ambadar, Kenneth M. Prkachin, Patty Solomon, Barry-John Theobald:
The painful face: pain expression recognition using active appearance models. ICMI 2007: 9-14 - [c8]Barry-John Theobald, Iain A. Matthews, Jeffrey F. Cohn, Steven M. Boker:
Real-time expression cloning using appearance models. ICMI 2007: 134-139 - 2006
- [c7]Barry-John Theobald, Iain A. Matthews, Simon Baker:
Evaluating Error Functions for Robust Active Appearance Models. FGR 2006: 149-154 - 2004
- [j2]Barry-John Theobald, J. Andrew Bangham, Iain A. Matthews, Gavin C. Cawley:
Near-videorealistic synthetic talking faces: implementation and evaluation. Speech Commun. 44(1-4): 127-140 (2004) - 2003
- [b1]Barry-John Theobald:
Visual speech synthesis using shape and appearance models. University of East Anglia, Norwich, UK, 2003 - [j1]Barry-John Theobald, Silko Kruse, J. Andrew Bangham, Gavin C. Cawley:
Towards a low bandwidth talking face using appearance models. Image Vis. Comput. 21(13-14): 1117-1124 (2003) - [c6]Barry-John Theobald, J. Andrew Bangham, Iain A. Matthews, Gavin C. Cawley:
Evaluation of a talking head based on appearance models. AVSP 2003: 187-192 - [c5]Barry-John Theobald, J. Andrew Bangham, Iain A. Matthews, John R. W. Glauert, Gavin C. Cawley:
2.5D Visual Speech Synthesis Using Appearance Models. BMVC 2003: 1-10 - [c4]Barry-John Theobald, Gavin C. Cawley, Iain A. Matthews, J. Andrew Bangham:
Near-videorealistic synthetic visual speech using non-rigid appearance models. ICASSP (5) 2003: 800-803 - 2002
- [c3]Barry-John Theobald, J. Andrew Bangham, Iain A. Matthews, Gavin C. Cawley:
Towards video realistic synthetic visual speech. ICASSP 2002: 3892-3895 - 2001
- [c2]Barry-John Theobald, J. Andrew Bangham, Iain A. Matthews, Gavin C. Cawley:
Visual speech synthesis using statistical models of shape and appearance. AVSP 2001: 78-83 - [c1]Barry-John Theobald, Gavin C. Cawley, Silko Kruse, J. Andrew Bangham:
Towards a low bandwidth talking face using appearance models. BMVC 2001: 1-10
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-24 21:35 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint