You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Doing increasingly deep pagination based only on a start_offset isn't very efficient (to fetch 10k docs with a start_offset of 100k, you'd need to find the best 110k results, and drop the first 100k). For that reason, we don't support deep pagination that way.
Currently there isn't an alternative on the Quickwit API. If you don't mind using the ES-compatible API instead, we support both search_after and scroll, which don't suffer from that performance degradation (at least on Quickwit, scrolls are deprecated on ES).
Describe the bug
A clear and concise description of what the bug is.
Steps to reproduce (if applicable)
Steps to reproduce the behavior:
I am trying to search through fineweb and I want to collect all the matches. However, it doesn't seem to be possible as start_offset is capped to 10k.
Expected behavior
A clear and concise description of what you expected to happen.
I want an easy way to collect all the matches. Even better, I just want their ids.
Configuration:
Please provide:
quickwit --version
The text was updated successfully, but these errors were encountered: