Skip to content

Taking politicians words in political duels under data scrutiny, analyzing what are their most frequent narratives and used words.

Notifications You must be signed in to change notification settings

dhajnes/political_nlp

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Berme politikov za slovo banner

PoliticalNLP

Taking politicians words in political duels under data scrutiny, analyzing their most frequent narratives and used words in Slovak language.

Berme politikov za slovo analyzovaním ich vyjadrení v politických dueloch, v slovenskom jazyku.

  • Whisper for speech-to-text
  • ChatGPT for smoothing out phonetically incorrect text
  • Basic NLP tokenization and stemming to acquire most used narratives and words

Case study

TA3, V Politike: Fico vs. Kollár

TA3, V Politike: Fico vs. Kollár

Fig. 1 | Histogram showing most common words in the political duel (skipping high-frequency words such as "ktorý/ktorá" etc.)

Future work

  • classification of different speakers using frequency analysis (Wavelet, Quefrency or simple Fourier transform) so words can be correctly assigned to speakers
  • sentiment analysis
  • most common statement outlineing
  • researching the use of a homeland MLM SlovakBERT
  • streamlining with GPT4 instead of GPT3.5

About

Taking politicians words in political duels under data scrutiny, analyzing what are their most frequent narratives and used words.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages