Fusion: Practice and Applications
  FPA
  2692-4048
  2770-0070
  
   10.54216/FPA
   https://www.americaspg.com/journals/show/2620
  
 
 
  
   2018
  
  
   2018
  
 
 
  
   Role of Context in Visual Language Models for Object Recognition and Detection in Irregular Scene Images
  
  
   GITAM (Deemed to be University), Visakhapatnam, A.P., India; PVP Siddhartha Institute of Technology, Vijayawada, A.P., India
   
    A.
    A.
   
    GITAM (Deemed to be University), Visakhapatnam, A.P., India
   
    T.
    Umadevi
   
  
  
   In this work, we rethink the phonetic growing experience in scene message recognition and abandon the broadly acknowledged complex language model. We present a Visual Language Displaying Organization (Vision LAN), which considers the visual and etymological data as an association by straightforwardly enriching the vision model with language capacities, rather than prior strategies that look at the visual and semantic data in two free designs. Specifically, we present person shrewd impeded highlight map message recognition in the preparation stage. At the point when visual prompts (like impediment, commotion, and so on) are perplexed, this activity guides the vision model to utilize both the visual surface of the characters and the phonetic data in the visual setting for recognition. To improve the performance of visual language models devoted to item identification and recognition in irregular scene images, the abstract investigates the critical function that context plays. Distinguished by intricate and ever-changing visual components, irregular sceneries pose distinct difficulties for conventional computer vision systems.
  
  
   2024
  
  
   2024
  
  
   250
   261
  
  
   10.54216/FPA.150120
   https://www.americaspg.com/articleinfo/3/show/2620