Calculated based on number of publications stored in Pure and citations from Scopus
20152024

Research activity per year

Filter
Conference contribution

Search results

  • 2024

    Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation

    Yariv, G., Gat, I., Benaim, S., Wolf, L., Schwartz, I. & Adi, Y., 25 Mar 2024, Proceedings of the 38th AAAI Conference on Artificial Intelligence: AAAI-24 Technical Tracks 7. Wooldridge, M., Dy, J. & Natarajan, S. (eds.). 7 ed. Association for the Advancement of Artificial Intelligence, Vol. 38. p. 6639-6647 9 p. (Proceedings of the AAAI Conference on Artificial Intelligence; vol. 38, no. 7).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Open Access
    3 Scopus citations
  • 2023

    AERO: Audio Super Resolution in the Spectral Domain

    Mandel, M., Tal, O. & Adi, Y., 2023, ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing, Proceedings. Institute of Electrical and Electronics Engineers Inc., p. 1-5 5 p. (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings; vol. 2023-June).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Open Access
    13 Scopus citations
  • Augmentation Invariant Discrete Representation for Generative Spoken Language Modeling

    Gat, I., Kreuk, F., Nguyen, T. A., Lee, A., Copet, J., Synnaeve, G., Dupoux, E. & Adi, Y., 2023, 20th International Conference on Spoken Language Translation, IWSLT 2023 - Proceedings of the Conference. Salesky, E., Federico, M. & Carpuat, M. (eds.). Association for Computational Linguistics, p. 465-477 13 p. (20th International Conference on Spoken Language Translation, IWSLT 2023 - Proceedings of the Conference).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    3 Scopus citations
  • Generative Spoken Language Model based on continuous word-sized audio tokens

    Algayres, R., Adi, Y., Nguyen, T. A., Copet, J., Synnaeve, G., Sagot, B. & Dupoux, E., 2023, EMNLP 2023 - 2023 Conference on Empirical Methods in Natural Language Processing, Proceedings. Bouamor, H., Pino, J. & Bali, K. (eds.). Association for Computational Linguistics (ACL), p. 3008-3023 16 p. (EMNLP 2023 - 2023 Conference on Empirical Methods in Natural Language Processing, Proceedings).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    1 Scopus citations
  • I Hear Your True Colors: Image Guided Audio Generation

    Sheffer, R. & Adi, Y., 2023, ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing, Proceedings. Institute of Electrical and Electronics Engineers Inc., p. 1-5 5 p. (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings; vol. 2023-June).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Open Access
    19 Scopus citations
  • Speaking Style Conversion in the Waveform Domain Using Discrete Self-Supervised Units

    Maimon, G. & Adi, Y., 2023, Findings of the Association for Computational Linguistics: EMNLP 2023. Association for Computational Linguistics (ACL), p. 8048-8061 14 p. (Findings of the Association for Computational Linguistics: EMNLP 2023).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Open Access
  • Stop: A Dataset for Spoken Task Oriented Semantic Parsing

    Tomasello, P., Shrivastava, A., Lazar, D., Hsu, P. C., Le, D., Sagar, A., Elkahky, A., Copet, J., Hsu, W. N., Adi, Y., Algayres, R., Nguyen, T. A., Dupoux, E., Zettlemoyer, L. & Mohamed, A., 2023, 2022 IEEE Spoken Language Technology Workshop, SLT 2022 - Proceedings. Institute of Electrical and Electronics Engineers Inc., p. 991-998 8 p. (2022 IEEE Spoken Language Technology Workshop, SLT 2022 - Proceedings).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Open Access
    12 Scopus citations
  • 2022

    CONTINUAL SELF-TRAINING WITH BOOTSTRAPPED REMIXING FOR SPEECH ENHANCEMENT

    Tzinis, E., Adi, Y., Ithapu, V. K., Xu, B. & Kumar, A., 2022, 2022 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2022 - Proceedings. Institute of Electrical and Electronics Engineers Inc., p. 6947-6951 5 p. (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings; vol. 2022-May).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Open Access
    9 Scopus citations
  • Direct Speech-to-Speech Translation With Discrete Units

    Lee, A., Chen, P. J., Wang, C., Gu, J., Popuri, S., Ma, X., Polyak, A., Adi, Y., He, Q., Tang, Y., Pino, J. & Hsu, W. N., 2022, ACL 2022 - 60th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (Long Papers). Muresan, S., Nakov, P. & Villavicencio, A. (eds.). Association for Computational Linguistics (ACL), p. 3327-3339 13 p. (Proceedings of the Annual Meeting of the Association for Computational Linguistics; vol. 1).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    64 Scopus citations
  • On the Importance of Gradient Norm in PAC-Bayesian Bounds

    Gat, I., Adi, Y., Schwing, A. & Hazan, T., 2022, Advances in Neural Information Processing Systems 35 - 36th Conference on Neural Information Processing Systems, NeurIPS 2022. Koyejo, S., Mohamed, S., Agarwal, A., Belgrave, D., Cho, K. & Oh, A. (eds.). Neural information processing systems foundation, (Advances in Neural Information Processing Systems; vol. 35).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    1 Scopus citations
  • Text-Free Prosody-Aware Generative Spoken Language Modeling

    Kharitonov, E., Lee, A., Polyak, A., Adi, Y., Copet, J., Lakhotia, K., Nguyen, T. A., Rivière, M., Mohamed, A., Dupoux, E. & Hsu, W. N., 2022, ACL 2022 - 60th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (Long Papers). Muresan, S., Nakov, P. & Villavicencio, A. (eds.). Association for Computational Linguistics (ACL), p. 8666-8681 16 p. (Proceedings of the Annual Meeting of the Association for Computational Linguistics; vol. 1).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    32 Scopus citations
  • textless-lib: a Library for Textless Spoken Language Processing

    Kharitonov, E., Copet, J., Lakhotia, K., Nguyen, T. A., Tomasello, P., Lee, A., Elkahky, A., Hsu, W. N., Mohamed, A., Dupoux, E. & Adi, Y., 2022, NAACL 2022 - 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Demonstrations Session. Association for Computational Linguistics (ACL), p. 1-9 9 p. (NAACL 2022 - 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Demonstrations Session).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    8 Scopus citations
  • Textless Speech-to-Speech Translation on Real Data

    Lee, A., Gong, H., Duquenne, P. A., Schwenk, H., Chen, P. J., Wang, C., Popuri, S., Adi, Y., Pino, J., Gu, J. & Hsu, W. N., 2022, NAACL 2022 - 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference. Association for Computational Linguistics (ACL), p. 860-872 13 p. (NAACL 2022 - 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    46 Scopus citations
  • 2021

    Fairness in the Eyes of the Data: Certifying Machine-Learning Models

    Segal, S., Adi, Y., Pinkas, B., Baum, C., Ganesh, C. & Keshet, J., 21 Jul 2021, AIES 2021 - Proceedings of the 2021 AAAI/ACM Conference on AI, Ethics, and Society. Association for Computing Machinery, Inc, p. 926-935 10 p. (AIES 2021 - Proceedings of the 2021 AAAI/ACM Conference on AI, Ethics, and Society).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Open Access
    20 Scopus citations
  • FAIRSEQ S2: A Scalable and Integrable Speech Synthesis Toolkit

    Wang, C., Hsu, W. N., Adi, Y., Polyak, A., Lee, A., Chen, P. J., Gu, J. & Pino, J., 2021, EMNLP 2021 - 2021 Conference on Empirical Methods in Natural Language Processing: System Demonstrations. Association for Computational Linguistics (ACL), p. 143-152 10 p. (EMNLP 2021 - 2021 Conference on Empirical Methods in Natural Language Processing: System Demonstrations).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    13 Scopus citations
  • Speech resynthesis from discrete disentangled self-supervised representations

    Polyak, A., Adi, Y., Copet, J., Kharitonov, E., Lakhotia, K., Hsu, W. N., Mohamed, A. & Dupoux, E., 2021, 22nd Annual Conference of the International Speech Communication Association, INTERSPEECH 2021. International Speech Communication Association, p. 3531-3535 5 p. (Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH; vol. 5).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Open Access
    86 Scopus citations
  • 2020

    Hide and Speak: Towards deep neural networks for speech steganography

    Kreuk, F., Adi, Y., Raj, B., Singh, R. & Keshet, J., 2020, Interspeech 2020. International Speech Communication Association, p. 4656-4660 5 p. (Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH; vol. 2020-October).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Open Access
    10 Scopus citations
  • Phoneme Boundary Detection Using Learnable Segmental Features

    Kreuk, F., Sheena, Y., Keshet, J. & Adi, Y., May 2020, 2020 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2020 - Proceedings. Institute of Electrical and Electronics Engineers Inc., p. 8089-8093 5 p. 9053053. (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings; vol. 2020-May).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Open Access
    21 Scopus citations
  • Real time speech enhancement in the waveform domain

    Défossez, A., Synnaeve, G. & Adi, Y., 2020, Interspeech 2020. International Speech Communication Association, p. 3291-3295 5 p. (Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH; vol. 2020-October).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Open Access
    242 Scopus citations
  • Self-supervised contrastive learning for unsupervised phoneme segmentation

    Kreuk, F., Keshet, J. & Adi, Y., 2020, Interspeech 2020. International Speech Communication Association, p. 3700-3704 5 p. (Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH; vol. 2020-October).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Open Access
    45 Scopus citations
  • Unsupervised cross-domain singing voice conversion

    Polyak, A., Wolf, L., Adi, Y. & Taigman, Y., 2020, Interspeech 2020. International Speech Communication Association, p. 801-805 5 p. (Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH; vol. 2020-October).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Open Access
    25 Scopus citations
  • Voice separation with an unknown number of multiple speakers

    Nachmani, E., Adi, Y. & Wolf, L., 2020, 37th International Conference on Machine Learning, ICML 2020. Daume, H. & Singh, A. (eds.). International Machine Learning Society (IMLS), p. 7121-7132 12 p. (37th International Conference on Machine Learning, ICML 2020; vol. PartF168147-10).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    73 Scopus citations
  • 2019

    To Reverse the Gradient or Not: An Empirical Comparison of Adversarial and Multi-task Learning in Speech Recognition

    Adi, Y., Zeghidour, N., Collobert, R., Usunier, N., Liptchinsky, V. & Synnaeve, G., May 2019, 2019 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019 - Proceedings. Institute of Electrical and Electronics Engineers Inc., p. 3742-3746 5 p. 8682468. (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings; vol. 2019-May).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Open Access
    24 Scopus citations
  • 2018

    Fooling end-to-end speaker verification with adversarial examples

    Kreuk, F., Adi, Y., Cisse, M. & Keshet, J., 10 Sep 2018, 2018 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2018 - Proceedings. Institute of Electrical and Electronics Engineers Inc., p. 1962-1966 5 p. 8462693. (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings; vol. 2018-April).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Open Access
    155 Scopus citations
  • Turning your weakness into a strength: Watermarking deep neural networks by backdooring

    Adi, Y., Baum, C., Cisse, M., Pinkas, B. & Keshet, J., 2018, Proceedings of the 27th USENIX Security Symposium. USENIX Association, p. 1615-1631 17 p. (Proceedings of the 27th USENIX Security Symposium).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    498 Scopus citations
  • 2017

    Sequence segmentation using joint RNN and structured prediction models

    Adi, Y., Keshet, J., Cibelli, E. & Goldrick, M., 16 Jun 2017, 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2017 - Proceedings. Institute of Electrical and Electronics Engineers Inc., p. 2422-2426 5 p. 7952591. (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Open Access
    19 Scopus citations
  • 2015

    Vowel duration measurement using deep neural networks

    Adi, Y., Keshet, J. & Goldrick, M., 10 Nov 2015, 2015 IEEE International Workshop on Machine Learning for Signal Processing - Proceedings of MLSP 2015. Erdogmus, D., Kozat, S., Larsen, J. & Akcakaya, M. (eds.). IEEE Computer Society, 7324331. (IEEE International Workshop on Machine Learning for Signal Processing, MLSP; vol. 2015-November).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Open Access
    10 Scopus citations
Your message has successfully been sent.
Your message was not sent due to an error.