Add FAQs and Common Issues doc page #7547

GregoryComer · 2025-01-08T02:02:51Z

Summary

Add "FAQs and Common Issues" page to the ExecuTorch docs. This summarizes common issues that we've seen when users adopt ExecuTorch.

I've tentatively put this under the Getting Started section, as that seems like the most reasonable place to put it, but I'm open to suggestions.

Test plan

New doc page preview: https://docs-preview.pytorch.org/pytorch/executorch/7547/getting-started-faqs.html

pytorch-bot · 2025-01-08T02:02:54Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/7547

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit e8e0865 with merge base 6c9b9b6 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

mergennachin · 2025-01-08T02:17:03Z

docs/source/getting-started-faqs.md

+
+We are actively working to improve the out-of-box behavior, but the above APIs can be used to improve mobile performance as workaround until deeper changes for performant core detection land.
+
+### Erroa setting input: 0x10 / Attempted to resize a bounded tensor...


malfet · 2025-01-08T02:49:40Z

docs/source/getting-started-faqs.md

+
+### Duplicate Kernel Registration Abort
+
+This manifests as a crash call stack including ExecuTorch kernel registration and failing with an `et_pal_abort`. This typically means there are multiple `gen_operators_lib` targets linked into the applications. There must be only one generated lib per target, though each model can have its own `gen_selected_ops/generate_bindings_for_kernels` call.


generated lib -> generated operator library

malfet · 2025-01-08T02:52:21Z

docs/source/getting-started-faqs.md

+
+### Performance Troubleshooting
+
+Ensure the model is delegated. If not targeting a specific accelerator, use the XNNPACK delegate for CPU performance. Undelegated operators will typically fall back to the ExecuTorch portable library, which is designed as a platform-independent fallback, and is not optimized for specific hardware.


ExecuTorch portable library, which is designed as a platform-independent fallback, and is not optimized for specific hardware.

which is design to serve as a reference implementation/fallback and not intended to be used in a performance sensitive production scenarios.

malfet · 2025-01-08T02:53:34Z

docs/source/getting-started-faqs.md

+
+Ensure the model is delegated. If not targeting a specific accelerator, use the XNNPACK delegate for CPU performance. Undelegated operators will typically fall back to the ExecuTorch portable library, which is designed as a platform-independent fallback, and is not optimized for specific hardware.
+
+Additionally, thread counts are a common source of performance issues. While we are working to improve the default behavior, ExecuTorch will currently use as many threads as there are cores. On some heterogenous mobile SOCs, this can be slow. Consider setting the thread count to cores / 2, or just set to 4. This will lead to a speedup (or maintain parity) on almost all mobile devices.


This will lead to a speedup (or maintain parity) on almost all mobile devices.

This might lead to a speedup?

Because if it always will, why this is not a default?

Also I would probably add a reference to a function other document that explain how CPU parallelism can be configured

There is no way to this in OSS at the moment except for the unsafe API

Add FAQs and Common Issues doc page

e8e0865

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jan 8, 2025

GregoryComer added the release notes: misc label Jan 8, 2025

mergennachin reviewed Jan 8, 2025

View reviewed changes

malfet reviewed Jan 8, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add FAQs and Common Issues doc page #7547

Add FAQs and Common Issues doc page #7547

GregoryComer commented Jan 8, 2025 •

edited

Loading

pytorch-bot bot commented Jan 8, 2025 •

edited

Loading

mergennachin Jan 8, 2025

malfet Jan 8, 2025

malfet Jan 8, 2025

malfet Jan 8, 2025

malfet Jan 8, 2025

kimishpatel Jan 8, 2025


		We are actively working to improve the out-of-box behavior, but the above APIs can be used to improve mobile performance as workaround until deeper changes for performant core detection land.

		### Erroa setting input: 0x10 / Attempted to resize a bounded tensor...


		### Duplicate Kernel Registration Abort

		This manifests as a crash call stack including ExecuTorch kernel registration and failing with an `et_pal_abort`. This typically means there are multiple `gen_operators_lib` targets linked into the applications. There must be only one generated lib per target, though each model can have its own `gen_selected_ops/generate_bindings_for_kernels` call.


		### Performance Troubleshooting

		Ensure the model is delegated. If not targeting a specific accelerator, use the XNNPACK delegate for CPU performance. Undelegated operators will typically fall back to the ExecuTorch portable library, which is designed as a platform-independent fallback, and is not optimized for specific hardware.


		Ensure the model is delegated. If not targeting a specific accelerator, use the XNNPACK delegate for CPU performance. Undelegated operators will typically fall back to the ExecuTorch portable library, which is designed as a platform-independent fallback, and is not optimized for specific hardware.

		Additionally, thread counts are a common source of performance issues. While we are working to improve the default behavior, ExecuTorch will currently use as many threads as there are cores. On some heterogenous mobile SOCs, this can be slow. Consider setting the thread count to cores / 2, or just set to 4. This will lead to a speedup (or maintain parity) on almost all mobile devices.

Add FAQs and Common Issues doc page #7547

Are you sure you want to change the base?

Add FAQs and Common Issues doc page #7547

Conversation

GregoryComer commented Jan 8, 2025 • edited Loading

Summary

Test plan

pytorch-bot bot commented Jan 8, 2025 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/7547

✅ No Failures

mergennachin Jan 8, 2025

Choose a reason for hiding this comment

malfet Jan 8, 2025

Choose a reason for hiding this comment

malfet Jan 8, 2025

Choose a reason for hiding this comment

malfet Jan 8, 2025

Choose a reason for hiding this comment

malfet Jan 8, 2025

Choose a reason for hiding this comment

kimishpatel Jan 8, 2025

Choose a reason for hiding this comment

GregoryComer commented Jan 8, 2025 •

edited

Loading

pytorch-bot bot commented Jan 8, 2025 •

edited

Loading