-
Notifications
You must be signed in to change notification settings - Fork 647
Enable strongly typed ops for deployment #13230
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/13230
Note: Links to docs will display an error until the docs builds have been completed. ❌ 2 New Failures, 1 Unrelated FailureAs of commit ea124be with merge base 0e76a97 ( NEW FAILURES - The following jobs have failed:
BROKEN TRUNK - The following job failed but were present on the merge base:👉 Rebase onto the `viable/strict` branch to avoid these failures
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
This pull request was exported from Phabricator. Differential Revision: D79867630 |
This PR needs a
|
#include <executorch/backends/cadence/hifi/kernels/kernels.h> | ||
#include <executorch/runtime/kernel/kernel_includes.h> | ||
|
||
namespace cadence { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nested namespaces with ::
are prefered since c++17, https://clang.llvm.org/extra/clang-tidy/checks/modernize/concat-nested-namespaces.html
This pull request was exported from Phabricator. Differential Revision: D79867630 |
Summary: Pull Request resolved: #13230 As titled. This allows a severe reduction in codesize by only using the bare minimum cpp code. Right now, this diff only implements that option for quantized fully connected per tensor. This diff is introducing opt level 4 to use for deployment purposes. The idea is similar to -Os in most compilers, we just use 4 for simplicity and because for now, everything should be inclusive by construction. This decision can be re-visited later! Differential Revision: D79867630
4bd5b3b
to
007022b
Compare
This pull request was exported from Phabricator. Differential Revision: D79867630 |
Summary: Pull Request resolved: #13230 As titled. This allows a severe reduction in codesize by only using the bare minimum cpp code. Right now, this diff only implements that option for quantized fully connected per tensor. This diff is introducing opt level 4 to use for deployment purposes. The idea is similar to -Os in most compilers, we just use 4 for simplicity and because for now, everything should be inclusive by construction. This decision can be re-visited later! Differential Revision: D79867630
007022b
to
368ca27
Compare
This pull request was exported from Phabricator. Differential Revision: D79867630 |
Summary: Pull Request resolved: #13230 As titled. This allows a severe reduction in codesize by only using the bare minimum cpp code. Right now, this diff only implements that option for quantized fully connected per tensor. This diff is introducing opt level 4 to use for deployment purposes. The idea is similar to -Os in most compilers, we just use 4 for simplicity and because for now, everything should be inclusive by construction. This decision can be re-visited later! Differential Revision: D79867630
368ca27
to
a7f91c5
Compare
This pull request was exported from Phabricator. Differential Revision: D79867630 |
Summary: Pull Request resolved: #13230 As titled. This allows a severe reduction in codesize by only using the bare minimum cpp code. Right now, this diff only implements that option for quantized fully connected per tensor. This diff is introducing opt level 4 to use for deployment purposes. The idea is similar to -Os in most compilers, we just use 4 for simplicity and because for now, everything should be inclusive by construction. This decision can be re-visited later! Differential Revision: D79867630
a7f91c5
to
9fba6c6
Compare
Summary: Pull Request resolved: pytorch#13230 As titled. This allows a severe reduction in codesize by only using the bare minimum cpp code. Right now, this diff only implements that option for quantized fully connected per tensor. This diff is introducing opt level 4 to use for deployment purposes. The idea is similar to -Os in most compilers, we just use 4 for simplicity and because for now, everything should be inclusive by construction. This decision can be re-visited later! Differential Revision: D79867630
Summary: Pull Request resolved: #13230 As titled. This allows a severe reduction in codesize by only using the bare minimum cpp code. Right now, this diff only implements that option for quantized fully connected per tensor. This diff is introducing opt level 4 to use for deployment purposes. The idea is similar to -Os in most compilers, we just use 4 for simplicity and because for now, everything should be inclusive by construction. This decision can be re-visited later! Reviewed By: zonglinpeng Differential Revision: D79867630
This pull request was exported from Phabricator. Differential Revision: D79867630 |
9fba6c6
to
ea124be
Compare
Summary:
As titled. This allows a severe reduction in codesize by only using the bare minimum cpp code. Right now, this diff only implements that option for quantized fully connected per tensor.
This diff is introducing opt level 4 to use for deployment purposes. The idea is similar to -Os in most compilers, we just use 4 for simplicity and because for now, everything should be inclusive by construction. This decision can be re-visited later!
Differential Revision: D79867630