Conference Papers
Conference Papers for 5 most recent years are listed below. ( * : corresponding author )
• Shin, W., Park, H. J., Kim, J. S., Juan, Y., Park, S. H., & Han, S. W. (2026). PrevMatch: Revisiting and Maximizing Temporal Knowledge in Semi-Supervised Semantic Segmentation. Winter Conference on Applications of Computer Vision (WACV)(Accept).
• Park, H. J., Liu, J., Kim, J. S., Yang, J. Y., Han, S. W., & Song, E. (2025). RapFlow-TTS: Rapid and High-Fidelity Text-to-Speech with Improved Consistency Flow Matching. INTERSPEECH.
• Park, H. J., Liu, J., Kim, J. S., Yang, J. Y., Han, S. W., & Song, E. (2025). RapFlow-TTS: Rapid and High-Fidelity Text-to-Speech with Improved Consistency Flow Matching. INTERSPEECH.
• Shin, W.**, Park, H.J.**, Kim, J.S.**, Kim, D., Lee, S. and Han, S.W.* (2023). Rethinking Transfer and Auxiliary Learning for Improving Audio Captioning Transformer. INTERSPEECH.
• Shin, W., Lee, B.H., Kim J.S., Park, H.J., and Han, S.W.* (2023). MetricGAN-OKD: Multi-Metric Optimization of MetricGAN via Online Knowledge Distillation for Speech Enhancement. International Conference on Machine Learning (ICML).
• Kim J.S., Park, H.J., Shin, W., and Han, S.W.* (2023). AD-YOLO: YOU LOOK ONLY ONCE IN TRAINING MULTIPLE SOUND EVENT LOCALIZATION AND DETECTION. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
• Park, H.J.**, Yang, S.W.**, Kim J.S., Shin, W., and Han, S.W.* (2023). TriAAN-VC: Triple Adaptive Attention Normalization for any-to-any Voice Conversion. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
• Lee, M.S.**, Yang, S.W.**, and Han S.W.* (2023). GaIA: Graphical Information Gain based Attention Network for Weakly Supervised Point Cloud Semantic Segmentation. Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV).
• Shin, W.**, Park, H.J.**, Kim, J.S., Lee, B.H., and Han, S.W.* (2022). Multi-View Attention Transfer for Efficient Speech Enhancement. INTERSPEECH.
• Kim, J.S.**, Park, H.J.**, Shin, W.**, and Han, S.W.* (2022), A Robust Framework for Sound Event Localization and Detection on Real Recordings. Technical report (3rd prize) for Sound Event Localization and Detection. Detection and Classification of Acoustic Scenes and Events (DCASE).
• Lee, M.S.**, Shin, W.**, and Han, S.W.* (2022). TRACER: Extreme Attention guided Salient Object Tracing Network. Proceedings of the AAAI Conference on Artificial Intelligence.
• Park, H.J., Kang, B.H., Shin, W., Kim J.S. and Han, S.W.* (2022). MANNER: MULTI-VIEW ATTENTION NETWORK FOR NOISE ERASURE. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).