Optimization of work done by the Crawler/Dashboard #84

derekpierre · 2020-10-30T13:27:15Z

Functionality like this, which projects stakes and stakers, can be problematic when the network is as large as it is:

    @collector(label="Projected Stake and Stakers")
    def _measure_future_locked_tokens(self, periods: int = 365):
        period_range = range(1, periods + 1)
        token_counter = dict()
        for day in period_range:
            tokens, stakers = self.staking_agent.get_all_active_stakers(periods=day, pagination_size=200)
            token_counter[day] = (float(NU.from_nunits(tokens).to_tokens()), len(stakers))
        return dict(token_counter)

The code effectively projects out the stakes and stakers for the network over the next year. For each day in the next year, it gets information on all of the active stakes. When the network is large this is exceptionally computationally expensive.

Two questions arise:

Do we need to show this functionality? (commented out in https://github.com/derekpierre/nucypher-monitor/tree/overloaded branch)

By simply commenting it out it has improved our data collection round time to about 1 minute (down from 15minutes sometimes 😱 ) and improved stability; node count was about 1800 nodes.

crawler_1   | Scraping Round #613 ========================
crawler_1   | ✓ ... Current Period
crawler_1   | ✓ ... Date/Time of Next Period [0s]
crawler_1   | ✓ ... Latest Teacher [0s]
crawler_1   | ✓ ... Previous Fleet States [0s]
crawler_1   | ✓ ... Network Event Details [0s]
crawler_1   | ✓ ... Known Node Details [0s]
crawler_1   | ✓ ... Known Nodes [31.0s]
crawler_1   | ✓ ... Staker Confirmation Status [31.0s]
crawler_1   | ✓ ... Global Network Locked Tokens
crawler_1   | ✓ ... Top Stakes [0s]
crawler_1   | Scraping round completed (duration 0:01:02).

If we do want to keep that functionality, how can that functionality be optimized.

The text was updated successfully, but these errors were encountered:

cygnusv · 2020-10-30T14:16:22Z

I think we don't need this functionality, or more accurately, we don't need it with the same frequency than the main crawler loop (mainly discovery loop stuff). For this specific case, a secondary loop that runs daily would be more than enough.

derekpierre changed the title ~~Optimization work for the Crawler/Dashboard functionality~~ Optimization work done by the Crawler/Dashboard Oct 30, 2020

derekpierre changed the title ~~Optimization work done by the Crawler/Dashboard~~ Optimization of work done by the Crawler/Dashboard Oct 30, 2020

derekpierre mentioned this issue Oct 30, 2020

Don't overload the monitor trying to project stakers and stake size for a year in the future #85

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimization of work done by the Crawler/Dashboard #84

Optimization of work done by the Crawler/Dashboard #84

derekpierre commented Oct 30, 2020 •

edited

Loading

cygnusv commented Oct 30, 2020

Optimization of work done by the Crawler/Dashboard #84

Optimization of work done by the Crawler/Dashboard #84

Comments

derekpierre commented Oct 30, 2020 • edited Loading

cygnusv commented Oct 30, 2020

derekpierre commented Oct 30, 2020 •

edited

Loading