Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Optimize IMathEngine::TransposeMatrix #655

Merged
merged 3 commits into from
Jun 16, 2022

Conversation

FedyuninV
Copy link
Contributor

No description provided.

Valeriy Fedyunin added 3 commits June 15, 2022 19:46
Signed-off-by: Valeriy Fedyunin <valery.fedyunin@abbyy.com>
Signed-off-by: Valeriy Fedyunin <valery.fedyunin@abbyy.com>
Signed-off-by: Valeriy Fedyunin <valery.fedyunin@abbyy.com>
@FedyuninV FedyuninV changed the title Optimize math engine transpose Optimize IMathEngine::TransposeMatrix Jun 15, 2022
@SAngeliuk SAngeliuk merged commit 662b1a9 into neoml-lib:master Jun 16, 2022
@FedyuninV FedyuninV deleted the OptimizeMathEngineTranspose branch June 16, 2022 14:46
black-boomerang pushed a commit to black-boomerang/neoml that referenced this pull request Jun 16, 2022
* Copy data instead of transpose (where possible)

Signed-off-by: Valeriy Fedyunin <valery.fedyunin@abbyy.com>

* Optimize CPU transpose even further

Signed-off-by: Valeriy Fedyunin <valery.fedyunin@abbyy.com>

* Reduce diff size

Signed-off-by: Valeriy Fedyunin <valery.fedyunin@abbyy.com>
Signed-off-by: Aleksei Strashnov <aleksei.strashnov@abbyy.com>
FedyuninV pushed a commit that referenced this pull request Aug 12, 2022
* Added mask type parameter and rebuild method

Signed-off-by: Aleksei Strashnov <aleksei.strashnov@abbyy.com>

* Fix infinite growth of CBaseLayer::blobCache (#652)

* Fix infinite growth of blobCache

Signed-off-by: Valeriy Fedyunin <valery.fedyunin@abbyy.com>

* Fix iOS build

Signed-off-by: Valeriy Fedyunin <valery.fedyunin@abbyy.com>
Signed-off-by: Aleksei Strashnov <aleksei.strashnov@abbyy.com>

* Build Framework (NeoML-master 2.0.70.0): Incrementing version number.

Signed-off-by: Aleksei Strashnov <aleksei.strashnov@abbyy.com>

* Optimize IMathEngine::TransposeMatrix (#655)

* Copy data instead of transpose (where possible)

Signed-off-by: Valeriy Fedyunin <valery.fedyunin@abbyy.com>

* Optimize CPU transpose even further

Signed-off-by: Valeriy Fedyunin <valery.fedyunin@abbyy.com>

* Reduce diff size

Signed-off-by: Valeriy Fedyunin <valery.fedyunin@abbyy.com>
Signed-off-by: Aleksei Strashnov <aleksei.strashnov@abbyy.com>

* Updated mask type serialization

Signed-off-by: Aleksei Strashnov <aleksei.strashnov@abbyy.com>

* Added TransformerSourceMaskLayer

Signed-off-by: Aleksei Strashnov <aleksei.strashnov@abbyy.com>

* Fixed bugs

Signed-off-by: Aleksei Strashnov <aleksei.strashnov@abbyy.com>

* Added serialization file

Signed-off-by: Aleksei Strashnov <aleksei.strashnov@abbyy.com>

* Added serialization test for CTransformerSourceMaskLayer

Signed-off-by: Aleksei Strashnov <aleksei.strashnov@abbyy.com>

* Updated serialization file

Signed-off-by: Aleksei Strashnov <aleksei.strashnov@abbyy.com>

* Updated CTransformerSourceMaskLayer

Signed-off-by: Aleksei Strashnov <aleksei.strashnov@abbyy.com>

* Added TransformerSourceMaskTest to CMakeLists.txt

Signed-off-by: Aleksei Strashnov <aleksei.strashnov@abbyy.com>

* Fixed formatting

Signed-off-by: Aleksei Strashnov <aleksei.strashnov@abbyy.com>

Signed-off-by: Aleksei Strashnov <aleksei.strashnov@abbyy.com>
Signed-off-by: Valeriy Fedyunin <valery.fedyunin@abbyy.com>
Co-authored-by: Aleksei Strashnov <aleksei.strashnov@abbyy.com>
Co-authored-by: Valeriy Fedyunin <valery.fedyunin@abbyy.com>
Co-authored-by: BuildTech <BuildTech@abbyy.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants