-
Notifications
You must be signed in to change notification settings - Fork 37
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Adding support for automatic summarization #17
Comments
An LLM could likely help with this. |
We should probably implement a basic interface to let people choose how to obtain the summary, like we do for converting to plain text. So, users can choose to use a LLM. Just like the original library, this was designed for people that wanted a light and privacy-oriented solution to get an article free of clutter. So, I do not think that a LLM would be good a fit for integration in this library. To be fair, I never found a good way to do this algorithmically, hence why we should give users a simple way to do what they want. |
A fair concern I totally agree with. An extraction based algorithm could be added, sorting out important parts of text after ranking the sentences, but quality would probably vary a lot and complexity spiral out of control, with i18n in mind. Maybe there could be some nice text analysis visualization tools that could aid in skimming through the text more quickly, instead of creating summaries (even a simple highlight on the longest sentences, or of field-specific terms based on frequency scores). Abstraction summarizations, with privacy in mind, could be implemented in some years, if we get local-based LLMs. |
I use SmartReader in combination with If you want to be able to use different languages, I would suggest to use a LLM, instead of creating some algorithm yourself. |
This is probably the best solution, since I never found a good way to do this algorithmically, it is better to just give a hook to help users do as they want. |
The library can extract any manual excerpt that is contained in the article (i.e., the short summary that usually is shown in Facebook or Twitter). However, it can be useful to also generate an automatic summary for long articles.
The issue is that there does not seem to be nothing really effective and light on resources to do that. So, the end result may vary in quality.
The text was updated successfully, but these errors were encountered: