default search action

combined dblp search
author search
venue search
publication search

ask others

Jeongsoo Choi

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/KimCKR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/KimCKR24
Minsu Kim, Jeongsoo Choi, Dahun Kim, Yong Man Ro:
Textless Unit-to-Unit Training for Many-to-Many Multilingual Speech-to-Speech Translation. IEEE ACM Trans. Audio Speech Lang. Process. 32: 3934-3946 (2024)
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/tmm/YeoKCKR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmm/YeoKCKR24
Jeong Hun Yeo, Minsu Kim, Jeongsoo Choi, Dae Hoe Kim, Yong Man Ro:
AKVSR: Audio Knowledge Empowered Visual Speech Recognition by Compressing Audio Knowledge of a Pretrained Model. IEEE Trans. Multim. 26: 6462-6474 (2024)
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/ChoiPKR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/ChoiPKR24
Jeongsoo Choi, Se Jin Park, Minsu Kim, Yong Man Ro:
AV2AV: Direct Audio-Visual Speech to Audio-Visual Speech Translation with Unified Audio-Visual Speech Representation. CVPR 2024: 27315-27327
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ParkKCR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ParkKCR24
Se Jin Park, Minsu Kim, Jeongsoo Choi, Yong Man Ro:
Exploring Phonetic Context-Aware Lip-Sync for Talking Face Generation. ICASSP 2024: 4325-4329
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KimCMY0R24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KimCMY0R24
Minsu Kim, Jeongsoo Choi, Soumi Maiti, Jeong Hun Yeo, Shinji Watanabe, Yong Man Ro:
Towards Practical and Efficient Image-to-Speech Captioning with Vision-Language Pre-Training and Multi-Modal Tokens. ICASSP 2024: 7970-7974
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChoiKPR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChoiKPR24
Jeongsoo Choi, Minsu Kim, Se Jin Park, Yong Man Ro:
Text-Driven Talking Face Synthesis by Reprogramming Audio-Driven Models. ICASSP 2024: 8065-8069
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-09802
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-09802
Minsu Kim, Jeong Hun Yeo, Jeongsoo Choi, Se Jin Park, Yong Man Ro:
Multilingual Visual Speech Recognition with a Single Model by Learning with Discrete Visual Speech Units. CoRR abs/2401.09802 (2024)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-13839
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-13839
Tan Dat Nguyen, Ji-Hoon Kim, Jeongsoo Choi, Shukjae Choi, Jinseok Park, Younglo Lee, Joon Son Chung:
Accelerating Codec-based Speech Synthesis with Multi-Token Prediction and Speculative Decoding. CoRR abs/2410.13839 (2024)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-20502
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-20502
Zongyi Li, Shujie Hu, Shujie Liu, Long Zhou, Jeongsoo Choi, Lingwei Meng, Xun Guo, Jinyu Li, Hefei Ling, Furu Wei:
ARLON: Boosting Diffusion Transformers with Autoregressive Models for Long Video Generation. CoRR abs/2410.20502 (2024)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-19486
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-19486
Jeongsoo Choi, Ji-Hoon Kim, Jinyu Li, Joon Son Chung, Shujie Liu:
V2SFlow: Video-to-Speech Generation with Speech Decomposition and Rectified Flow. CoRR abs/2411.19486 (2024)
2023
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/HongKCR23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/HongKCR23
Joanna Hong, Minsu Kim, Jeongsoo Choi, Yong Man Ro:
Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling and Reliability Scoring. CVPR 2023: 18783-18794
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/ChoiHR23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/ChoiHR23
Jeongsoo Choi, Joanna Hong, Yong Man Ro:
DiffV2S: Diffusion-based Video-to-Speech Synthesis with Vision-guided Speaker Embedding. ICCV 2023: 7778-7787
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/KimYCR23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/KimYCR23
Minsu Kim, Jeong Hun Yeo, Jeongsoo Choi, Yong Man Ro:
Lip Reading for Low-resource Languages by Learning and Combining General Speech Knowledge and Language-specific Knowledge. ICCV 2023: 15313-15325
[c2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChoiKR23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChoiKR23
Jeongsoo Choi, Minsu Kim, Yong Man Ro:
Intelligible Lip-to-Speech Synthesis with Speech Units. INTERSPEECH 2023: 4349-4353
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-08536
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-08536
Joanna Hong, Minsu Kim, Jeongsoo Choi, Yong Man Ro:
Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling and Reliability Scoring. CoRR abs/2303.08536 (2023)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-19556
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-19556
Se Jin Park, Minsu Kim, Jeongsoo Choi, Yong Man Ro:
Exploring Phonetic Context in Lip Movement for Authentic Talking Face Generation. CoRR abs/2305.19556 (2023)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-19603
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-19603
Jeongsoo Choi, Minsu Kim, Yong Man Ro:
Intelligible Lip-to-Speech Synthesis with Speech Units. CoRR abs/2305.19603 (2023)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-16003
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-16003
Jeongsoo Choi, Minsu Kim, Se Jin Park, Yong Man Ro:
Reprogramming Audio-driven Talking Face Synthesis into Text-driven. CoRR abs/2306.16003 (2023)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-01831
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-01831
Minsu Kim, Jeongsoo Choi, Dahun Kim, Yong Man Ro:
Many-to-Many Spoken Language Translation via Unified Speech and Text Representation Learning with Unit-to-Unit Translation. CoRR abs/2308.01831 (2023)
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-07593
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-07593
Jeong Hun Yeo, Minsu Kim, Jeongsoo Choi, Dae Hoe Kim, Yong Man Ro:
AKVSR: Audio Knowledge Empowered Visual Speech Recognition by Compressing Audio Knowledge of a Pretrained Model. CoRR abs/2308.07593 (2023)
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-07787
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-07787
Jeongsoo Choi, Joanna Hong, Yong Man Ro:
DiffV2S: Diffusion-based Video-to-Speech Synthesis with Vision-guided Speaker Embedding. CoRR abs/2308.07787 (2023)
[i4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-09311
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-09311
Minsu Kim, Jeong Hun Yeo, Jeongsoo Choi, Yong Man Ro:
Lip Reading for Low-resource Languages by Learning and Combining General Speech Knowledge and Language-specific Knowledge. CoRR abs/2308.09311 (2023)
[i3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-08531
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-08531
Minsu Kim, Jeongsoo Choi, Soumi Maiti, Jeong Hun Yeo, Shinji Watanabe, Yong Man Ro:
Towards Practical and Efficient Image-to-Speech Captioning with Vision-Language Pre-training and Multi-modal Tokens. CoRR abs/2309.08531 (2023)
[i2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-02512
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-02512
Jeongsoo Choi, Se Jin Park, Minsu Kim, Yong Man Ro:
AV2AV: Direct Audio-Visual Speech to Audio-Visual Speech Translation with Unified Audio-Visual Speech Representation. CoRR abs/2312.02512 (2023)
2022
[c1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/ParkKHCR22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/ParkKHCR22
Se Jin Park, Minsu Kim, Joanna Hong, Jeongsoo Choi, Yong Man Ro:
SyncTalkFace: Talking Face Generation with Precise Lip-Syncing via Audio-Lip Memory. AAAI 2022: 2062-2070
[i1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-00924
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-00924
Se Jin Park, Minsu Kim, Joanna Hong, Jeongsoo Choi, Yong Man Ro:
SyncTalkFace: Talking Face Generation with Precise Lip-Syncing via Audio-Lip Memory. CoRR abs/2211.00924 (2022)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.