[WIP] Various new features: Enhanced JIT Compiler targeted to multiple devices, ONNX Support, et al. #154

hikettei · 2024-05-25T14:23:11Z

Refactors

Reduce code to less than 10000 lines.
Remove cl-waffe2-simd, with keeping AVX/Neon/SLEEF intrinsic supports.
Remove JITCPUTensor instead of adding backends/aten.

AbstractNodes

Changes on base-impl
- Unified and simplified arithmetic operations using !view.
  - cl-waffe2 uses broadcast+arithmetic ops instead of removing ScalarAdd/ScalarSub/ScalarMul/ScalarDiv/InverseTensorNode.
- Rename: !inv -> !reciprocal
- Fix typo: !leaky-relu
keepdims option for !max/!min etc

Command Line Tool

Uses Fiveam -> Rove as a testing tool.
Make all tests hardware-independent
Added a command line tool at ./roswell/waffe2.ros
From ONNX to C Interpreter mode (waffe2 codegen xxx)
- It generates the minimum C interpreter (calling CUDA/Metal functions), minimizing the dependencies.

ShapeTracker

[Ehhancement] [ ] to express ScalarTensor, instead of using [scal] and out-scalar-p=t.

Frontend

Implement from onnx to cl-waffe2 IR mode.

Backend

Ready for implementing AbstractTensor backend
- Fast Conv2d kernel (reduces memory-latency, winograd)
- Ready for implementing GPU Supports, including Triton, Metal, CUDA, etc.
Remove cl-waffe2-simd
Remove obsolete SIMD dependencies
Merge the aten-runtime branch.

APIs

Implement AbstractListTensor
- It basically behaves (Shape) Tensor.
- but when called with forward, it wraps λ function w/ loop.
- When called with !concatenate, it creates (LazyXXX ~ Shape) where LazyXXX = (length x) tensor
- It enables implementing kv-cache for transformer models.
Control-FLow
- IfNode IfNode ~ EndIfNode
- LoopNode LoopNode ~ EndLoopNode
- POV: Is wf2IR turing-complete?

…ithmetic operations across Scalar and matrix

…o rove.

…ication model training test.

…rds the acc

[Refactor] Added Command Line Tool, Hardware-independent unittest, unified arithmetic operations, updated workflow, et al.

…ng id2table refinements due to cName refactors.

…ompilation-mode

[Experimental] Enabled the use of the enhanced JIT Compiler (AbstractTensor.lisp)

… when aten is given as a list.

…pport, and new onnx ops

[WIP] From ONNX Mode

hikettei · 2024-06-11T04:21:05Z

I will merge the changes once as it is unlikely to get much development time for a while T_T. (plus, the IR specifications I have formulated are so bad that it is difficult to add new features. I need to take enough time and redevelop the entire back-end from scratch. but is it worth it?...)

hikettei and others added 24 commits May 24, 2024 13:36

[Refactor] Renamed inv -> reciprocal and unified the definition of ar…

8ca9c14

…ithmetic operations across Scalar and matrix

[Refactor] Fixed typos: leaky_relu, etc.

1044cea

[Refactor] Removed scalarXXX impls

8a87f92

fix typo: inv -> reciprocal

5a604dd

refactor: arithmetic ops, hw-independent tests

e2c50e0

[refactor] fixed typo in in-place

7d06b41

fixed issues related to broadcasting and reshaping/bviewing

021dcfe

passing all tests

379c32d

[Refactor] Updated Command Line developing tools

6bbad2e

[Refator] Make all tests hardware independent, and uses from fiveam t…

3478d09

…o rove.

[Refactor] Updated the documentation

67d8070

updated the badge

7fd3e64

[Refactor] Added demonstration calling from command line tool

541819c

[Enhancement] Added an ansi-colored timestamp and info system

b58cc79

[Workflow] Uses different cases for different hardware, added classif…

42bcaaf

…ication model training test.

Merge branch 'develop', remote-tracking branch 'origin' into refactor

32bb564

[Workflow] Fixed indentation

ed3ce05

[Enhancement] added --config options, mlp generates a file which reco…

60b62f5

…rds the acc

[Workflow] Installing SIMD Extension, checking accuracy, et al.

f2cb2a3

[Workflow] Added sudo

eb94aba

[Workflow] Removed Installing Extension step

8b1e609

[Workflow] epoch-num=2

3f252df

Update test_on_push.yml

55121d7

Merge pull request #153 from hikettei/refactor

054524f

[Refactor] Added Command Line Tool, Hardware-independent unittest, unified arithmetic operations, updated workflow, et al.

hikettei changed the title ~~[WIP] Refactor~~ [WIP] Refactoring May 25, 2024

hikettei added 5 commits May 29, 2024 17:15

[Feature] Get ready for implementing aten backend.

1ebff72

[Enhancement] Implemented a baseline for implementing the aten backend

85230ba

[Add] Unary ops

5c93432

[Enhancement] Parameterized Backend Configuration using symbol-macro

3b02eae

[Update] Aten backends require first call (Aten[Backend] ...) macro.

c4d8537

hikettei and others added 9 commits June 1, 2024 15:59

Add: nn.lisp

19a2ad9

[Refactor] Purged JITCPUTensor backend.

863a1d2

[Enhancement] Supports for configurated runtime declaration

90c001e

Tweaked the example for mnist demonstration get worked again, includi…

10d94b1

…ng id2table refinements due to cName refactors.

[Optimization] Reduced the number of invoking gcc; by enabling lazy-c…

0a80373

…ompilation-mode

[BugFix] Stride Error

22acca5

[BugFix] adding initial_offset when using Aten runtime

212d30e

Merge pull request #155 from hikettei/aten-runtime

0d532bb

[Experimental] Enabled the use of the enhanced JIT Compiler (AbstractTensor.lisp)

[Enhancement] Added: lazy-values. build/proceed bundles multiple DAGs…

d7be015

… when aten is given as a list.

hikettei changed the title ~~[WIP] Refactoring~~ [WIP] Various new features: Enhanced JIT Compiler targeted to multiple devices, and ONNX Support, et al. Jun 4, 2024

hikettei changed the title ~~[WIP] Various new features: Enhanced JIT Compiler targeted to multiple devices, and ONNX Support, et al.~~ [WIP] Various new features: Enhanced JIT Compiler targeted to multiple devices, ONNX Support, et al. Jun 4, 2024

hikettei and others added 16 commits June 4, 2024 12:07

[Add] cl-waffe2.frontend

f789cae

[Feature] cl-waffe2.frontend was born

152a283

[Experimental] From ONNX to WFIR Converter

f8599d1

[Feature] Add Gemm Translation Pattern

29ce500

[Enhancement] New ops: Conv

c7674ab

[Enhancement] Lazy Computation for padding

5831de4

[Enhancement] Dynamic shape computation for imXXX ops

48240d4

[Enhancement] Added LazyAssertion

e2301a3

[Enhancement] Various new features including dynamic shaped tensor su…

1f667bb

…pport, and new onnx ops

ready for implementing yolov3

be13af4

[BugFix] Fixed Conv2D Shape Infer

d525492

[Add] Avgpool

a17d9d8

Various changes

da3921c

[WIP] Only the well tested opset converter alives in the code

78a5396

Merge pull request #157 from hikettei/features/from-onnx-mode

38611c5

[WIP] From ONNX Mode

[Workflow] Include Aten[Clang] Testing

06f5891

[Workflow] Export The Qlot path

a3b57b3

hikettei merged commit b9ec982 into master Jun 11, 2024
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Various new features: Enhanced JIT Compiler targeted to multiple devices, ONNX Support, et al. #154

[WIP] Various new features: Enhanced JIT Compiler targeted to multiple devices, ONNX Support, et al. #154

hikettei commented May 25, 2024 •

edited

Loading

hikettei commented Jun 11, 2024

[WIP] Various new features: Enhanced JIT Compiler targeted to multiple devices, ONNX Support, et al. #154

[WIP] Various new features: Enhanced JIT Compiler targeted to multiple devices, ONNX Support, et al. #154

Conversation

hikettei commented May 25, 2024 • edited Loading

Refactors

AbstractNodes

Command Line Tool

ShapeTracker

Frontend

Backend

APIs

hikettei commented Jun 11, 2024

hikettei commented May 25, 2024 •

edited

Loading