A Review of Deep Learning–Based Sarcasm Detection in Monolingual Speech Using Sentiment Analysis

Authors

  • Agrawal Nikita Manohar,Dr. Manav Thakur

Keywords:

Sarcasm Detection, Deep Learning, Monolingual Speech, Sentiment Analysis, Affective Computing

Abstract

Sarcasm detection in spoken language has emerged as a challenging problem in affective computing and speech analytics due to its reliance on implicit meaning, contextual cues, and paralinguistic features. In monolingual speech settings, sarcasm is often conveyed through subtle variations in prosody, tone, pitch, and rhythm rather than explicit lexical markers, making traditional rule-based or shallow machine learning approaches insufficient. Recent advances in deep learning have significantly improved sarcasm detection by enabling automatic feature learning from raw audio signals and speech-derived representations. This review critically examines deep learning–based approaches for sarcasm detection in monolingual speech with a specific focus on sentiment analysis–driven frameworks. It discusses commonly used architectures such as convolutional neural networks, recurrent neural networks, long short-term memory networks, and transformer-based models, highlighting how they integrate acoustic, prosodic, and sentiment-oriented features. The paper also reviews widely used speech corpora, evaluation metrics, and preprocessing techniques relevant to monolingual sarcastic speech analysis. Furthermore, key challenges—including data sparsity, speaker dependency, contextual ambiguity, and class imbalance—are identified, along with emerging trends such as multimodal fusion and self-supervised learning. This review provides a structured synthesis of current research and outlines future directions for building more robust and context-aware sarcasm detection systems in monolingual spoken discourse.

References

Abdullah, T., & Ahmet, A. (2022). Deep learning in sentiment analysis: Recent architectures. ACM Computing Surveys, 55(8), 1-37.

Abhinav, A., & Kumar, P. (2021). Detection of sarcasm through tone analysis using CNN-LSTM on audio features. International Journal of Speech Technology, 24(3), 715–728.

Acheampong, F. A., Nunoo-Mensah, H., & Chen, W. (2021). Transformer models for text-based emotion detection: a review of BERT-based approaches. Artificial Intelligence Review, 54(8), 5789-5829.

Afiyati, A., Azhari, A., Sari, A. K., & Karim, A. (2020). Challenges of sarcasm detection for social network: a literature review. JUITA: Jurnal Informatika, 169-178.

Arora, S., & Gupta, R. (2021). A deep learning pipeline for monolingual sarcasm detection from spontaneous speech corpora. Speech Communication, 132, 56–68.

Arslan, S., & Chen, Y. (2018). Sentiment polarity reversal in sarcastic speech: Challenges for sentiment analysis systems. Information Retrieval Journal, 21(3), 231–252.

Bedi, H., & Rao, M. (2020). VGGish feature extraction for paralinguistic classification tasks: Applications to sarcasm detection. ICASSP 2020 Workshop Proceedings.

Bharti, S. K., Gupta, R. K., Shukla, P. K., Hatamleh, W. A., Tarazi, H., & Nuagah, S. J. (2022). Multimodal sarcasm detection: a deep learning approach. Wireless Communications and Mobile Computing, 2022(1), 1653696.

Cai, Y., Cai, H., & Wan, X. (2019). Multi-modal sarcasm detection in Twitter with hierarchical fusion model. In Proceedings of ACL (short/Findings / workshop papers as applicable).

Calvo, R. A., & D'Mello, S. (2019). Sentiment and emotion recognition in speech: From features to deep models. IEEE Transactions on Affective Computing, 10(2), 173–187.

Castro, F., & Poria, S. (2020). Emotion and sentiment labels for multimodal sarcasm corpora: annotator agreement and baselines. Proceedings of the 28th International Conference on Computational Linguistics (COLING 2020).

Chauhan, D. S., Singh, G. V., Arora, A., Ekbal, A., & Bhattacharyya, P. (2022, October). A sentiment and emotion aware multimodal multiparty humor recognition in multilingual conversational setting. In Proceedings of the 29th international conference on computational linguistics (pp. 6752-6761).

Downloads

How to Cite

Agrawal Nikita Manohar,Dr. Manav Thakur. (2024). A Review of Deep Learning–Based Sarcasm Detection in Monolingual Speech Using Sentiment Analysis. International Journal of Engineering Science & Humanities, 14(4), 302–313. Retrieved from https://www.ijesh.com/j/article/view/566

Similar Articles

<< < 15 16 17 18 19 20 21 22 > >> 

You may also start an advanced similarity search for this article.