Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

FWS-1372 Fix os_eventq_put() to wake up task only if EVQ_WAIT flag is set #3117

Closed
wants to merge 1 commit into from

Conversation

vrahane
Copy link
Contributor

@vrahane vrahane commented Dec 7, 2023

  • This was causing a task waiting on a semaphore to wakeup on scheduling an event

@vrahane vrahane changed the title FWS-1372 Fix os_eventq_put() to wake up task only if EVQ_WAIT flag is FWS-1372 Fix os_eventq_put() to wake up task only if EVQ_WAIT flag is set Dec 7, 2023
Copy link

@wes3 wes3 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Well this is quite the good catch I must say. Unless I am mistaken this can cause a number of issues that would be hard to find/debug. Again, unless I am mistaken, if code uses os_time_delay(), a task can get woken up before the delay expires if an event is posted. I cannot see anything wrong with this fix.

Comment on lines +72 to +73
if (evq->evq_task->t_state == OS_TASK_SLEEP &&
evq->evq_task->t_flags & OS_TASK_FLAG_EVQ_WAIT) {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

While this does not break anything I don't think it solves the problem.
evq->evq_task is only set when task calls os_eventq_get() this is blocking call so it's strange that same task can later call os_sem_pend().
For me it more likely looks like memory corruption of task struct.

Copy link
Contributor Author

@vrahane vrahane Dec 7, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think I did not do a good job at explaining this. So, what is happening is that we have the default task waiting on a semaphore. The task actually wakes up when os_eventq_put() is called which should not happen, it should only wakeup when os_sem_release() is called on the pending semaphore. So, what this change does is it forces the scheduler to be blocked on the semaphore.
The above code did the right thing back in 2015-2016 before we added os_eventq_poll(). You will have to go back into incubator-mynewt-core to look at the exact code. The functionality broke when OS_TASK_SLEEP flag was added I believe.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

So, what this change does is it forces the scheduler to be blocked on the semaphore even though an event is scheduled if thats what the task has been waiting for.

This is exactly what's impossible here: task cannot be blocked on semaphore and wait for an event at the same time.

The condition you modify is only executed if evq->evq_task != NULL and this is only valid if task is blocked on os_eventq_get. If you have task blocked on a semaphore which is also set as evq_task in some eventq then something is broken elsewhere.

Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I read the code in more detail (which I should have done before approving admittedly) and yes you are correct. Checking evq_task first will guarantee the above situation does not occur. This is good :-) Thanks guys

Copy link
Contributor Author

@vrahane vrahane Dec 8, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you, makes complete sense. We got a bit carried away by the semaphore behavior which contributed to the issue.

set

- This was causing a task waiting on a semaphore to wakeup on scheduling
an event
@vrahane vrahane force-pushed the vipul/os_eventq_put_fix branch from f0ad606 to 7f6187b Compare December 7, 2023 23:22
@vrahane
Copy link
Contributor Author

vrahane commented Dec 7, 2023

Closing this, I tried reproducing it with semaphore tokens initialized to 0 before pending which was causing our issue.

@vrahane vrahane closed this Dec 7, 2023
@vrahane vrahane deleted the vipul/os_eventq_put_fix branch December 7, 2023 23:59
@vrahane vrahane requested a review from wes3 December 8, 2023 14:26
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants