Skip to content

Latest commit

 

History

History
39 lines (20 loc) · 1.84 KB

README.md

File metadata and controls

39 lines (20 loc) · 1.84 KB

ConvoCloud

ConvoCloud is an app that provides a real-time visualisation of the semantic content of spoken language. It achieves this by integrating the technologies of Automatic Speech Recognition, Natural Language Processing and Word Cloud generation.

Compatability & Demo

This first version currently runs in Google Chrome Desktop.

Use Cases

With fixes and additions it could possibly be used by those with hearing impairments as an assistive tool for capturing the topics of a spoken conversation. Other possible use cases include lectures, meetings and other scenarios where a semantic summary of what is being said would be useful.

Screenshot

ConvoCloud Example

Importance Algorithm

The importance, i.e. size, of a word in the cloud is currently determined by a) frequency b) syntactic word class c) character length. However, this is still being adjusted.

Tech/FrameWorks Used

Web

This is a Flask web app that utlises Flask-Bootstrap. RequireJS is responsible for serving the JavaScript modules. To send speech input to the web server for processing the app uses SocketIO.

Automatic Speech Recognition

To capture speech input for the cloud the Web Speech API is used.

Natural Language Processing

The python scripts that process the raw speech input and create semantically useful tokens for visualisation make use of the Natural Language Toolkit. This includes: tokenisation, removal of stop words and swear words and lemmatisation.

Word Cloud Generation

The bright and beautiful word clouds are rendered using the JQcloud library.