An Approach for Devising Stenography Application Using Cross Modal Attention

 

Shanthalakshmi M *,Susmita Mishra , LincyJemina S , Raashmi P, Mannuru Shalin,jananeee.v

 

Rajalakshmi Engineering College, Panimalar Institute of Technology,India

Emails: shanthalakshmi.m@rajalakshmi.edu.in; susmitamishra12@gmail.com; lincypit@gmail.com; raashmi.p.2018.cse@rajalakshmi.edu.in; mannuru.shalini.2018.cse@rajalakshmi.edu.in; jananee.v@rajalakshmi.edu.in

 

Abstract

This paper focuses on providing a solution to the direct conversion of speech to shorthand. Since shorthand is not understood by many but is used for writing quick transcripts, a product is developed that converts the speech to its appropriate Gregg shorthand. A website that will be used as a front end, will use a speech-to-text API to record the speech in real-time. The converted text will then be fed into a text-to-image retrieval model that derives its corresponding Gregg shorthand for the text. The text will then be displayed to the user in real-time. By achieving this, the model reduces the need to depend upon stenographers for transcribing scripts. The resulting model achieves a good result.

  Keywords: Devising Stenography; Cross Modal Attention; speech shorthand; speech conversion