Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to specify the timeline in which we want to extract the posts? #100

Open
Nitigya-Handa opened this issue Aug 13, 2020 · 5 comments
Open

Comments

@Nitigya-Handa
Copy link

Is there a way I can specify dates both from and upto which I want to extract the posts from?

@kevinzg
Copy link
Owner

kevinzg commented Aug 17, 2020

No, the scraper just open the FB page at the latest posts and then "scrolls down" to get more posts.
I don't know if there is a way to request only posts from a certain range.

@Nitigya-Handa
Copy link
Author

Nitigya-Handa commented Aug 18, 2020

There must be a way because I had used a Twitter Scraper GetOldTweets3 which has a library known as TweetCriteria using which one is able to set the timeline and define keywords or hashtags to search/extract the tweets with.

@widedM
Copy link

widedM commented Sep 11, 2020

You can create a list of keywords and check if the post have minimum one word of that list. It solve the problem

@GivenToFlyCoder
Copy link

GivenToFlyCoder commented Nov 1, 2020

Use date time variable "time" as "from date limit", implement it in a while loop that breaks getting post if the date publication is less than you desired.

Upto date limit i don´t think it can be possible, because it scrolls down from actual date extraction, but you can store it and then filter it.

@kevinzg
Copy link
Owner

kevinzg commented Nov 7, 2020

Note that some pages have sticky posts, so it's a good idea to skip the check for the first posts of the page.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants