Skip to content

RuthKirby/convocloud

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

27 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ConvoCloud

ConvoCloud is an app that provides a real-time visualisation of the semantic content of spoken language. It achieves this by integrating the technologies of Automatic Speech Recognition, Natural Language Processing and Word Cloud generation.

Compatability & Demo

This first version currently runs in Google Chrome Desktop.

Use Cases

With fixes and additions it could possibly be used by those with hearing impairments as an assistive tool for capturing the topics of a spoken conversation. Other possible use cases include lectures, meetings and other scenarios where a semantic summary of what is being said would be useful.

Screenshot

ConvoCloud Example

Importance Algorithm

The importance, i.e. size, of a word in the cloud is currently determined by a) frequency b) syntactic word class c) character length. However, this is still being adjusted.

Tech/FrameWorks Used

Web

This is a Flask web app that utlises Flask-Bootstrap. RequireJS is responsible for serving the JavaScript modules. To send speech input to the web server for processing the app uses SocketIO.

Automatic Speech Recognition

To capture speech input for the cloud the Web Speech API is used.

Natural Language Processing

The python scripts that process the raw speech input and create semantically useful tokens for visualisation make use of the Natural Language Toolkit. This includes: tokenisation, removal of stop words and swear words and lemmatisation.

Word Cloud Generation

The bright and beautiful word clouds are rendered using the JQcloud library.

About

Real-time NLP app

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published