Exploring Optimal DNN Architecture for End-to-End Beamformers Based on Time-frequency References

Koyama, Yuichiro; Raj, Bhiksha

Electrical Engineering and Systems Science > Audio and Speech Processing

arXiv:2005.12683 (eess)

[Submitted on 23 May 2020 (v1), last revised 11 Aug 2020 (this version, v2)]

Title:Exploring Optimal DNN Architecture for End-to-End Beamformers Based on Time-frequency References

Authors:Yuichiro Koyama, Bhiksha Raj

View PDF

Abstract:Acoustic beamformers have been widely used to enhance audio signals. Currently, the best methods are the deep neural network (DNN)-powered variants of the generalized eigenvalue and minimum-variance distortionless response beamformers and the DNN-based filter-estimation methods that are used to directly compute beamforming filters. Both approaches are effective; however, they have blind spots in their generalizability. Therefore, we propose a novel approach for combining these two methods into a single framework that attempts to exploit the best features of both. The resulting model, called the W-Net beamformer, includes two components; the first computes time-frequency references that the second uses to estimate beamforming filters. The results on data that include a wide variety of room and noise conditions, including static and mobile noise sources, show that the proposed beamformer outperforms other methods on all tested evaluation metrics, which signifies that the proposed architecture allows for effective computation of the beamforming filters.

Comments:	arXiv admin note: substantial text overlap with arXiv:1910.14262
Subjects:	Audio and Speech Processing (eess.AS); Sound (cs.SD)
Cite as:	arXiv:2005.12683 [eess.AS]
	(or arXiv:2005.12683v2 [eess.AS] for this version)
	https://doi.org/10.48550/arXiv.2005.12683

Submission history

From: Yuichiro Koyama [view email]
[v1] Sat, 23 May 2020 22:30:15 UTC (134 KB)
[v2] Tue, 11 Aug 2020 08:04:39 UTC (134 KB)

Electrical Engineering and Systems Science > Audio and Speech Processing

Title:Exploring Optimal DNN Architecture for End-to-End Beamformers Based on Time-frequency References

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Audio and Speech Processing

Title:Exploring Optimal DNN Architecture for End-to-End Beamformers Based on Time-frequency References

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators