Skip to content

akbartus/WebVR-Captioning

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Image Captioning in Web VR

Screenshot

Description / Rationale

This is small project, which shows the use of image captioning (machine learning task; model used: https://huggingface.co/nlpconnect/vit-gpt2-image-captioning) as used in Web VR. It was inspired by similar project created by Misslivirose titled Scene Reader which shows image captioning with Three.js and Microsoft Azure service.

Instructions

To see image captioning at work, click on camera icon. On every click image of the scene with caption will be generated. In order to see the magic happen, try to find answer to the riddle.

Tools Used

The project uses A-Frame at its core with Hugging Face API.

Credits

3D model of the room was created by Francesco Coldesina, and taken from Sketchfab.com

Demo

To see the application at work: Demo application