default search action
Tran Huy Dat
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [i2]Tuan Duy Nguyen Le, Kah Kuan Teh, Huy Dat Tran:
Continuous Learning of Transformer-based Audio Deepfake Detection. CoRR abs/2409.05924 (2024) - 2021
- [c49]Kah Kuan Teh, Huy Dat Tran:
Open-Set Audio Classification with Limited Training Resources Based on Augmentation Enhanced Variational Auto-Encoder GAN with Detection-Classification Joint Training. Interspeech 2021: 4169-4173
2010 – 2019
- 2019
- [c48]Kah Kuan Teh, Tran Huy Dat:
Embedding Physical Augmentation and Wavelet Scattering Transform to Generative Adversarial Networks for Audio Classification with Limited Training Resources. ICASSP 2019: 3262-3266 - [c47]Tze Yuang Chong, Kye Min Tan, Kah Kuan Teh, Chang Huai You, Hanwu Sun, Tran Huy Dat:
The I2R's ASR System for the VOiCES from a Distance Challenge 2019. INTERSPEECH 2019: 2458-2462 - [c46]Tze Yuang Chong, Kye Min Tan, Kah Kuan Teh, Chang Huai You, Hanwu Sun, Tran Huy Dat:
The I2R's ASR System for the VOiCES from a Distance Challenge 2019. INTERSPEECH 2019 - [c45]Hanwu Sun, Kah Kuan Teh, Ivan Kukanov, Tran Huy Dat:
The I2R's Submission to VOiCES Distance Speaker Recognition Challenge 2019. INTERSPEECH 2019: 2478-2482 - [c44]Chang Huai You, Jichen Yang, Huy Dat Tran:
Device Feature Extractor for Replay Spoofing Detection. INTERSPEECH 2019: 2933-2937 - [c43]Kangkang Lu, Chuan-Sheng Foo, Kah Kuan Teh, Huy Dat Tran, Vijay Ramaseshan Chandrasekhar:
Semi-Supervised Audio Classification with Consistency-Based Regularization. INTERSPEECH 2019: 3654-3658 - [i1]Kong Aik Lee, Ville Hautamäki, Tomi Kinnunen, Hitoshi Yamamoto, Koji Okabe, Ville Vestman, Jing Huang, Guohong Ding, Hanwu Sun, Anthony Larcher, Rohan Kumar Das, Haizhou Li, Mickael Rouvier, Pierre-Michel Bousquet, Wei Rao, Qing Wang, Chunlei Zhang, Fahimeh Bahmaninezhad, Héctor Delgado, Jose Patino, Qiongqiong Wang, Ling Guo, Takafumi Koshinaka, Jiacen Zhang, Koichi Shinoda, Trung Ngo Trong, Md. Sahidullah, Fan Lu, Yun Tang, Ming Tu, Kah Kuan Teh, Tran Huy Dat, Kuruvachan K. George, Ivan Kukanov, Florent Desnous, Jichen Yang, Emre Yilmaz, Longting Xu, Jean-François Bonastre, Chenglin Xu, Zhi Hao Lim, Eng Siong Chng, Shivesh Ranjan, John H. L. Hansen, Massimiliano Todisco, Nicholas W. D. Evans:
I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences. CoRR abs/1904.07386 (2019) - 2017
- [c42]Tin Lay Nwe, Tran Huy Dat, Bin Ma:
Convolutional neural network with multi-task learning scheme for acoustic scene classification. APSIPA 2017: 1347-1350 - [c41]Tin Lay Nwe, Tran Huy Dat, Wen Zheng Terence Ng, Bin Ma:
An Integrated Solution for Snoring Sound Classification Using Bhattacharyya Distance Based GMM Supervectors with SVM, Feature Selection with Random Forest and Spectrogram with CNN. INTERSPEECH 2017: 3467-3471 - [c40]Tran Huy Dat, Wen Zheng Terence Ng, Yi Ren Leng:
Data Augmentation, Missing Feature Mask and Kernel Classification for Through-the-Wall Acoustic Surveillance. INTERSPEECH 2017: 3807-3811 - 2016
- [c39]Tran Huy Dat, Jonathan William Dennis, Yi Ren Leng, Wen Zheng Terence Ng:
A comparative study of multi-channel processing methods for noisy automatic speech recognition in urban environments. ICASSP 2016: 6465-6469 - 2015
- [j10]Jonathan William Dennis, Tran Huy Dat, Haizhou Li:
Generalized Hough Transform for Speech Pattern Classification. IEEE ACM Trans. Audio Speech Lang. Process. 23(11): 1963-1972 (2015) - [c38]Jonathan William Dennis, Tran Huy Dat:
Single and multi-channel approaches for distant speech recognition under noisy reverberant conditions: I2R'S system description for the ASpIRE challenge. ASRU 2015: 518-524 - [c37]Jonathan William Dennis, Tran Huy Dat, Haizhou Li:
Combining robust spike coding with spiking neural networks for sound event classification. ICASSP 2015: 176-180 - [c36]Jonathan William Dennis, Tran Huy Dat, Haizhou Li:
Spiking neural networks and the generalised hough transform for speech pattern detection. INTERSPEECH 2015: 1997-2001 - [c35]Huy Dat Tran, Jonathan William Dennis, Wen Zheng Terence Ng:
The 12R ASR system for IWSLT 2015. IWSLT (Evaluation Campaign) 2015 - [c34]Tran Huy Dat, Jonathan William Dennis, Wen Zheng Terence Ng:
Missing Feature Kernel and Nonparametric Window Subband Power Distribution for Robust Sound Event Classification. SPECOM 2015: 277-284 - 2014
- [c33]Jonathan William Dennis, Tran Huy Dat:
Enhanced local feature approach for overlapping sound event recognition. APSIPA 2014: 1-4 - [c32]Yi Ren Leng, Tran Huy Dat:
Multi-label bird classification using an ensemble classifier with simple features. APSIPA 2014: 1-5 - [c31]Yi Ren Leng, Jonathan William Dennis, Tran Huy Dat:
Bird Classification using Ensemble Classifiers. CLEF (Working Notes) 2014: 654-661 - [c30]Jonathan William Dennis, Tran Huy Dat, Haizhou Li, Engsiong Chng:
A discriminatively trained Hough Transform for frame-level phoneme recognition. ICASSP 2014: 2514-2518 - [c29]Tran Huy Dat, Wen Zheng Terence Ng, Jonathan William Dennis, Yi Ren Leng:
Generalized Gaussian Distribution Kullback-Leibler kernel for robust sound event recognition. ICASSP 2014: 5949-5953 - [c28]Jonathan William Dennis, Tran Huy Dat, Chng Eng Siong:
Analysis of spectrogram image methods for sound event classification. INTERSPEECH 2014: 2533-2537 - 2013
- [j9]Jonathan William Dennis, Tran Huy Dat, Engsiong Chng:
Overlapping sound event recognition using local spectrogram features and the generalised hough transform. Pattern Recognit. Lett. 34(9): 1085-1093 (2013) - [j8]Jonathan William Dennis, Tran Huy Dat, Engsiong Chng:
Image Feature Representation of the Subband Power Distribution for Robust Sound Event Classification. IEEE Trans. Speech Audio Process. 21(2): 367-377 (2013) - [c27]Wen Zheng Terence Ng, Tran Huy Dat, Jonathan William Dennis, Chng Eng Siong:
A robust sound event recognition framework under TV playing conditions. APSIPA 2013: 1-5 - [c26]Wen Zheng Terence Ng, Tran Huy Dat, Huynh Thai Hoa, Chng Eng Siong:
Adaptive semi-supervised tree SVM for sound event recognition in home environments. APSIPA 2013: 1-4 - [c25]Wen Zheng Terence Ng, Tran Huy Dat, Jonathan William Dennis, Chng Eng Siong:
Robust sound event recognition under TV playing conditions. ChinaSIP 2013: 332-336 - [c24]Jonathan William Dennis, Qiang Yu, Huajin Tang, Tran Huy Dat, Haizhou Li:
Temporal coding of local spectrogram features for robust sound recognition. ICASSP 2013: 803-807 - [c23]Yeow Kee Tan, Alvin Hong Yee Wong, Chern Yuen Anthony Wong, Tran Anh Dung, Adrian Hwang Jian Tay, Dilip Kumar Limbu, Tran Huy Dat, Weng Zheng Ng, Rui Yan, Benedict Tay Tiong Chee:
Evaluation of the Pet Robot CuDDler Using Godspeed Questionnaire. ICOST 2013: 102-109 - [c22]Dilip Kumar Limbu, Chern Yuen Anthony Wong, Adrian Hwang Jian Tay, Tran Anh Dung, Yeow Kee Tan, Tran Huy Dat, Alvin Hong Yee Wong, Wen Zheng Terence Ng, Ridong Jiang, Jun Li:
Affective social interaction with CuDDler robot. RAM 2013: 179-184 - 2012
- [j7]Yi Ren Leng, Tran Huy Dat, Norihide Kitaoka, Haizhou Li:
Selective Gammatone Envelope Feature for Robust Sound Event Recognition. IEICE Trans. Inf. Syst. 95-D(5): 1229-1237 (2012) - [c21]Jonathan William Dennis, Tran Huy Dat, Engsiong Chng:
Overlapping Sound Event Recognition using Local Spectrogram Features with the Generalised Hough Transform. INTERSPEECH 2012: 2266-2269 - [c20]Yi Ren Leng, Tran Huy Dat:
Using Blob Detection in Missing Feature Linear-Frequency Cepstral Coefficients for Robust Sound Event Recognition. INTERSPEECH 2012: 2506-2509 - 2011
- [j6]Jonathan William Dennis, Tran Huy Dat, Haizhou Li:
Spectrogram Image Feature for Sound Event Classification in Mismatched Conditions. IEEE Signal Process. Lett. 18(2): 130-133 (2011) - [c19]Tran Huy Dat, Haizhou Li:
Probabilistic distance SVM with Hellinger-Exponential Kernel for sound event classification. ICASSP 2011: 2272-2275 - [c18]Tran Huy Dat, Haizhou Li:
Jump Function Kolmogorov for overlapping audio event classification. ICASSP 2011: 3696-3699 - [c17]Yi Ren Leng, Tran Huy Dat, Norihide Kitaoka, Haizhou Li:
Alternative Frequency Scale Cepstral Coefficient for Robust Sound Event Recognition. INTERSPEECH 2011: 297-300 - [c16]Huynh Thai Hoa, An Vu Tran, Tran Huy Dat:
Semi-Supervised Tree Support Vector Machine for Online Cough Recognition. INTERSPEECH 2011: 1637-1640 - [c15]Jonathan William Dennis, Tran Huy Dat, Haizhou Li:
Image Representation of the Subband Power Distribution for Robust Sound Classification. INTERSPEECH 2011: 2437-2440 - 2010
- [c14]Tran Huy Dat, Yi Ren Leng, Haizhou Li:
Feature integration for heart sound biometrics. ICASSP 2010: 1714-1717 - [c13]Yi Ren Leng, Tran Huy Dat, Norihide Kitaoka, Haizhou Li:
Selective gammatone filterbank feature for robust sound event recognition. INTERSPEECH 2010: 2246-2249
2000 – 2009
- 2009
- [j5]Tran Huy Dat, Haizhou Li:
Jump function Kolmogorov for audio classification in noise-mismatch conditions. IEEE Trans. Signal Process. 57(8): 2908-2918 (2009) - [c12]Tran Huy Dat, Haizhou Li:
Sound event classification based on Feature Integration, Recursive Feature Elimination and Structured Classification. ICASSP 2009: 177-180 - 2008
- [j4]Tran Huy Dat, Kazuya Takeda, Fumitada Itakura:
Multichannel Speech Enhancement Based on Generalized Gamma Prior Distribution with Its Online Adaptive Estimation. IEICE Trans. Inf. Syst. 91-D(3): 439-447 (2008) - [j3]Koksoon Phua, Jianfeng Chen, Tran Huy Dat, Louis Shue:
Heart sound as a biometric. Pattern Recognit. 41(3): 906-919 (2008) - [c11]Tran Huy Dat, Haizhou Li:
Jump function komogorov and its application for audio stream segmentation and classification. ICASSP 2008: 3353-3356 - [c10]Tran Huy Dat, Haizhou Li:
Speaker identification in noise mismatch conditions based on jump function Kolmogorov analysis in wavelet domain. INTERSPEECH 2008: 1469-1472 - 2007
- [c9]Tran Huy Dat, Cuntai Guan:
Feature Selection Based on Fisher Ratio and Mutual Information Analyses for Robust Brain Computer Interface. ICASSP (1) 2007: 337-340 - [c8]Jianfeng Chen, Tran Huy Dat, Koksoon Phua, Jit Biswas, Maniyeri Jayachandran:
Using Keyword Spotting and Replacement for Speech Anonymization. ICME 2007: 548-551 - 2006
- [j2]Tran Huy Dat, Kazuya Takeda, Fumitada Itakura:
Gamma Modeling of Speech Power and Its On-Line Estimation for Statistical Speech Enhancement. IEICE Trans. Inf. Syst. 89-D(3): 1040-1049 (2006) - [j1]Tran Huy Dat, Kazuya Takeda, Fumitada Itakura:
On-line Gaussian mixture modeling in the log-power domain for signal-to-noise ratio estimation and speech enhancement. Speech Commun. 48(11): 1515-1527 (2006) - [c7]Tran Huy Dat, Louis Shue, Cuntai Guan:
Electrocorticographic signal classification based on time-frequency decomposition and nonparametric statistical modeling. EMBC 2006: 2292-2295 - [c6]Tran Huy Dat, Kazuya Takeda, Fumitada Itakura:
Multichannel Speech Enhancement Based on Speech Spectral Magnitude Estimation Using Generalized Gamma Prior Distribution. ICASSP (4) 2006: 1149-1152 - 2005
- [c5]Tran Huy Dat, Kazuya Takeda, Fumitada Itakura:
Generalized gamma modeling of speech and its online estimation for speech enhancement. ICASSP (4) 2005: 181-184 - [c4]Kazuya Takeda, Tran Huy Dat, Hiroshi Fujimura, Fumitada Itakura:
SNR and Local Noise Power Estimations Based on Gaussian Mixture Modeling on the Log-Power Domain. ICASSP (1) 2005: 881-884 - [c3]Tran Huy Dat, Kazuya Takeda, Fumitada Itakura:
A speech enhancement system based on data clustering and cumulative histogram equalization. ICDE Workshops 2005: 1207 - [c2]Tran Huy Dat, Kazuya Takeda, Fumitada Itakura:
Maximum a Posterior Probability and Cumulative Distribution Function Equalization Methods for Speech Spectral Estimation with Application in Noise Suppression Filtering. NOLISP 2005: 328-337 - 2004
- [c1]Weifeng Li, Kazuya Takeda, Fumitada Itakura, Tran Huy Dat:
Speech enhancement based on magnitude estimation using the gamma prior. INTERSPEECH 2004: 2693-2696
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-10 22:14 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint