default search action

combined dblp search
author search
venue search
publication search

ask others

Kazuki Shimada

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j2]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - journals/tmlr/TakidaI0SCLMUUL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/TakidaI0SCLMUUL24
Yuhta Takida, Yukara Ikemiya, Takashi Shibuya, Kazuki Shimada, Woosung Choi, Chieh-Hsin Lai, Naoki Murata, Toshimitsu Uesaka, Kengo Uchida, Wei-Hsiang Liao, Yuki Mitsufuji:
HQ-VAE: Hierarchical Discrete Representation Learning with Variational Bayes. Trans. Mach. Learn. Res. 2024 (2024)
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/emccompo/ShimadaO24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emccompo/ShimadaO24
Kazuki Shimada, Mototsugu Okushima:
On-Chip ESD Current Sensor for Nanosecond Oscillation Waveform Over Ampere Detecting. EMC Compo 2024: 46-50
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShimadaUK0TMK24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShimadaUK0TMK24
Kazuki Shimada, Kengo Uchida, Yuichiro Koyama, Takashi Shibuya, Shusuke Takahashi, Yuki Mitsufuji, Tatsuya Kawahara:
Zero- and Few-Shot Sound Event Localization and Detection. ICASSP 2024: 636-640
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShiSH0KZTKM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShiSH0KZTKM24
Hao Shi, Kazuki Shimada, Masato Hirano, Takashi Shibuya, Yuichiro Koyama, Zhi Zhong, Shusuke Takahashi, Tatsuya Kawahara, Yuki Mitsufuji:
Diffusion-Based Speech Enhancement with Joint Generative and Predictive Decoders. ICASSP 2024: 12951-12955
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-00365
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-00365
Yuhta Takida, Yukara Ikemiya, Takashi Shibuya, Kazuki Shimada, Woosung Choi, Chieh-Hsin Lai, Naoki Murata, Toshimitsu Uesaka, Kengo Uchida, Wei-Hsiang Liao, Yuki Mitsufuji:
HQ-VAE: Hierarchical Discrete Representation Learning with Variational Bayes. CoRR abs/2401.00365 (2024)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-01135
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-01135
Wei-Hsiang Liao, Yuhta Takida, Yukara Ikemiya, Zhi Zhong, Chieh-Hsin Lai, Giorgio Fabbro, Kazuki Shimada, Keisuke Toyama, Kin Wai Cheuk, Marco A. Martínez Ramírez, Shusuke Takahashi, Stefan Uhlich, Taketo Akama, Woosung Choi, Yuichiro Koyama, Yuki Mitsufuji:
Music Foundation Model as Generic Booster for Music Downstream Tasks. CoRR abs/2411.01135 (2024)
2023
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhongHSTTM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhongHSTTM23
Zhi Zhong, Masato Hirano, Kazuki Shimada, Kazuya Tateishi, Shusuke Takahashi, Yuki Mitsufuji:
An Attention-Based Approach to Hierarchical Multi-Label Music Instrument Classification. ICASSP 2023: 1-5
[c11]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ShimadaPS0UAHKT23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ShimadaPS0UAHKT23
Kazuki Shimada, Archontis Politis, Parthasaarathy Sudarsanam, Daniel Aleksander Krause, Kengo Uchida, Sharath Adavanne, Aapo Hakala, Yuichiro Koyama, Naoya Takahashi, Shusuke Takahashi, Tuomas Virtanen, Yuki Mitsufuji:
STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events. NeurIPS 2023
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/waspaa/ZhongSHSTSTM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/waspaa/ZhongSHSTSTM23
Zhi Zhong, Hao Shi, Masato Hirano, Kazuki Shimada, Kazuya Tateishi, Takashi Shibuya, Shusuke Takahashi, Yuki Mitsufuji:
Extending Audio Masked Autoencoders toward Audio Restoration. WASPAA 2023: 1-5
[d4]
- view
  authority control:
- export record
  dblp key:
  - data/10/PolitisSSHTKTAKUMV23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/data/10/PolitisSSHTKTAKUMV23
Archontis Politis, Kazuki Shimada, Parthasaarathy Sudarsanam, Aapo Hakala, Shusuke Takahashi, Daniel Aleksander Krause, Naoya Takahashi, Sharath Adavanne, Yuichiro Koyama, Kengo Uchida, Yuki Mitsufuji, Tuomas Virtanen:
STARSS23: Sony-TAu Realistic Spatial Soundscapes 2023. Version 1.0.0. Zenodo, 2023 [all versions]
[d3]
- view
  authority control:
- export record
  dblp key:
  - data/10/PolitisSSHTKTAKUMV23a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/data/10/PolitisSSHTKTAKUMV23a
Archontis Politis, Kazuki Shimada, Parthasaarathy Sudarsanam, Aapo Hakala, Shusuke Takahashi, Daniel Aleksander Krause, Naoya Takahashi, Sharath Adavanne, Yuichiro Koyama, Kengo Uchida, Yuki Mitsufuji, Tuomas Virtanen:
STARSS23: Sony-TAu Realistic Spatial Soundscapes 2023. Version 1.1.0. Zenodo, 2023 [all versions]
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-08136
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-08136
Zhi Zhong, Masato Hirano, Kazuki Shimada, Kazuya Tateishi, Shusuke Takahashi, Yuki Mitsufuji:
An Attention-based Approach to Hierarchical Multi-label Music Instrument Classification. CoRR abs/2302.08136 (2023)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-05857
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-05857
Masato Hirano, Kazuki Shimada, Yuichiro Koyama, Shusuke Takahashi, Yuki Mitsufuji:
Diffusion-based Signal Refiner for Speech Separation. CoRR abs/2305.05857 (2023)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-06701
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-06701
Zhi Zhong, Hao Shi, Masato Hirano, Kazuki Shimada, Kazuya Tateishi, Takashi Shibuya, Shusuke Takahashi, Yuki Mitsufuji:
Extending Audio Masked Autoencoders Toward Audio Restoration. CoRR abs/2305.06701 (2023)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-10734
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-10734
Hao Shi, Kazuki Shimada, Masato Hirano, Takashi Shibuya, Yuichiro Koyama, Zhi Zhong, Shusuke Takahashi, Tatsuya Kawahara, Yuki Mitsufuji:
Diffusion-Based Speech Enhancement with Joint Generative and Predictive Decoders. CoRR abs/2305.10734 (2023)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-09126
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-09126
Kazuki Shimada, Archontis Politis, Parthasaarathy Sudarsanam, Daniel Krause, Kengo Uchida, Sharath Adavanne, Aapo Hakala, Yuichiro Koyama, Naoya Takahashi, Shusuke Takahashi, Tuomas Virtanen, Yuki Mitsufuji:
STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events. CoRR abs/2306.09126 (2023)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-09223
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-09223
Kazuki Shimada, Kengo Uchida, Yuichiro Koyama, Takashi Shibuya, Shusuke Takahashi, Yuki Mitsufuji, Tatsuya Kawahara:
Zero- and Few-shot Sound Event Localization and Detection. CoRR abs/2309.09223 (2023)
2022
[c9]
- view
  - electronic edition @ dcase.community (open access)
  - details & citations
- export record
  dblp key:
  - conf/dcase/PolitisSSA0KTTM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dcase/PolitisSSA0KTTM22
Archontis Politis, Kazuki Shimada, Parthasaarathy Sudarsanam, Sharath Adavanne, Daniel Krause, Yuichiro Koyama, Naoya Takahashi, Shusuke Takahashi, Yuki Mitsufuji, Tuomas Virtanen:
STARSS22: A Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events. DCASE 2022
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShimadaKTTTM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShimadaKTTTM22
Kazuki Shimada, Yuichiro Koyama, Shusuke Takahashi, Naoya Takahashi, Emiru Tsunoo, Yuki Mitsufuji:
Multi-ACCDOA: Localizing And Detecting Overlapping Sounds From The Same Class With Auxiliary Duplicating Permutation Invariant Training. ICASSP 2022: 316-320
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PerezSKTM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PerezSKTM22
Ricardo Falcón Pérez, Kazuki Shimada, Yuichiro Koyama, Shusuke Takahashi, Yuki Mitsufuji:
Spatial Mixup: Directional Loudness Modification as Data Augmentation for Sound Event Localization and Detection. ICASSP 2022: 431-435
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KoyamaSTSTTTM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KoyamaSTSTTTM22
Yuichiro Koyama, Kazuhide Shigemi, Masafumi Takahashi, Kazuki Shimada, Naoya Takahashi, Emiru Tsunoo, Shusuke Takahashi, Yuki Mitsufuji:
Spatial Data Augmentation with Simulated Room Impulse Responses for Sound Event Localization and Detection. ICASSP 2022: 8872-8876
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/ismar/ShimadaSSKK22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ismar/ShimadaSSKK22
Kazuki Shimada, Taishi Sawabe, Hidehiko Shishido, Masayuki Kanbara, Itaru Kitahara:
Video Generation Unconsciously Evoking Pre-Motion to Passengers in Automated Vehicles. ISMAR Adjunct 2022: 342-347
[d2]
- view
  authority control:
- export record
  dblp key:
  - data/10/PolitisMSSAKKTTV22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/data/10/PolitisMSSAKKTTV22
Adavanne Politis, Yuki Mitsufuji, Parthasaarathy Sudarsanam, Kazuki Shimada, Sharath Adavanne, Yuichiro Koyama, Daniel Krause, Naoya Takahashi, Shusuke Takahashi, Tuomas Virtanen:
STARSS22: Sony-TAu Realistic Spatial Soundscapes 2022 dataset. Version 1.0.0. Zenodo, 2022 [all versions]
[d1]
- view
  authority control:
- export record
  dblp key:
  - data/10/PolitisMSSAKKTTV22a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/data/10/PolitisMSSAKKTTV22a
Archontis Politis, Yuki Mitsufuji, Parthasaarathy Sudarsanam, Kazuki Shimada, Sharath Adavanne, Yuichiro Koyama, Daniel Aleksander Krause, Naoya Takahashi, Shusuke Takahashi, Tuomas Virtanen:
STARSS22: Sony-TAu Realistic Spatial Soundscapes 2022 dataset. Version 1.1.0. Zenodo, 2022 [all versions]
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-01948
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-01948
Archontis Politis, Kazuki Shimada, Parthasaarathy Sudarsanam, Sharath Adavanne, Daniel Krause, Yuichiro Koyama, Naoya Takahashi, Shusuke Takahashi, Yuki Mitsufuji, Tuomas Virtanen:
STARSS22: A dataset of spatial recordings of real scenes with spatiotemporal annotations of sound events. CoRR abs/2206.01948 (2022)
2021
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShimadaKTTM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShimadaKTTM21
Kazuki Shimada, Yuichiro Koyama, Naoya Takahashi, Shusuke Takahashi, Yuki Mitsufuji:
Accdoa: Activity-Coupled Cartesian Direction of Arrival Representation for Sound Event Localization And Detection. ICASSP 2021: 915-919
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-10806
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-10806
Kazuki Shimada, Naoya Takahashi, Yuichiro Koyama, Shusuke Takahashi, Emiru Tsunoo, Masafumi Takahashi, Yuki Mitsufuji:
Ensemble of ACCDOA- and EINV2-based Systems with D3Nets and Impulse Response Simulation for Sound Event Localization and Detection. CoRR abs/2106.10806 (2021)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-06126
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-06126
Ricardo Falcón Pérez, Kazuki Shimada, Yuichiro Koyama, Shusuke Takahashi, Yuki Mitsufuji:
Spatial mixup: Directional loudness modification as data augmentation for sound event localization and detection. CoRR abs/2110.06126 (2021)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-06501
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-06501
Yuichiro Koyama, Kazuhide Shigemi, Masafumi Takahashi, Kazuki Shimada, Naoya Takahashi, Emiru Tsunoo, Shusuke Takahashi, Yuki Mitsufuji:
Spatial Data Augmentation with Simulated Room Impulse Responses for Sound Event Localization and Detection. CoRR abs/2110.06501 (2021)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-07124
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-07124
Kazuki Shimada, Yuichiro Koyama, Shusuke Takahashi, Naoya Takahashi, Emiru Tsunoo, Yuki Mitsufuji:
Multi-ACCDOA: Localizing and Detecting Overlapping Sounds from the Same Class with Auxiliary Duplicating Permutation Invariant Training. CoRR abs/2110.07124 (2021)
2020
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShimadaKI20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShimadaKI20
Kazuki Shimada, Yuichiro Koyama, Akira Inoue:
Metric Learning with Background Noise Class for Few-Shot Detection of Rare Sound Events. ICASSP 2020: 616-620
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2006-12014
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-12014
Kazuki Shimada, Naoya Takahashi, Shusuke Takahashi, Yuki Mitsufuji:
Sound Event Localization and Detection Using Activity-Coupled Cartesian DOA Vector and RD3net. CoRR abs/2006.12014 (2020)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-15306
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-15306
Kazuki Shimada, Yuichiro Koyama, Naoya Takahashi, Shusuke Takahashi, Yuki Mitsufuji:
ACCDOA: Activity-Coupled Cartesian Direction of Arrival Representation for Sound Event Localization and Detection. CoRR abs/2010.15306 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/ShimadaBMIYK19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ShimadaBMIYK19
Kazuki Shimada, Yoshiaki Bando, Masato Mimura, Katsutoshi Itoyama, Kazuyoshi Yoshii, Tatsuya Kawahara:
Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 27(5): 960-971 (2019)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1903-09341
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1903-09341
Kazuki Shimada, Yoshiaki Bando, Masato Mimura, Katsutoshi Itoyama, Kazuyoshi Yoshii, Tatsuya Kawahara:
Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition. CoRR abs/1903.09341 (2019)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1910-13724
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-13724
Kazuki Shimada, Yuichiro Koyama, Akira Inoue:
Metric Learning with Background Noise Class for Few-shot Detection of Rare Sound Events. CoRR abs/1910.13724 (2019)
2018
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShimadaBMIYK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShimadaBMIYK18
Kazuki Shimada, Yoshiaki Bando, Masato Mimura, Katsutoshi Itoyama, Kazuyoshi Yoshii, Tatsuya Kawahara:
Unsupervised Beamforming Based on Multichannel Nonnegative Matrix Factorization for Noisy Speech Recognition. ICASSP 2018: 5734-5738
2017
[c1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MimuraBSSYK17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MimuraBSSYK17
Masato Mimura, Yoshiaki Bando, Kazuki Shimada, Shinsuke Sakai, Kazuyoshi Yoshii, Tatsuya Kawahara:
Combined Multi-Channel NMF-Based Robust Beamforming for Noisy Speech Recognition. INTERSPEECH 2017: 2451-2455

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.