Skip to content

Releases: microsoft/ark

ARK v0.5.0

16 Dec 11:22
1762798
Compare
Choose a tag to compare
  • Integrate with MSCCL++
  • Removed dependency on gpudma
  • Add AMD CDNA3 architecture support
  • Support communication for AMD GPUs
  • Optimize OpGraph scheduling
  • Add a multi-GPU Llama2 example

See details from #168.

ARK v0.4.1

16 Nov 06:06
4e428dd
Compare
Choose a tag to compare

What's Changed

Full Changelog: v0.4.0...v0.4.1

ARK v0.4.0

14 Nov 09:51
e7bd04d
Compare
Choose a tag to compare
  • Support AMD GPUs (CDNA2, single-GPU only)
  • Add high-performance AllReduce & AllGather algorithms with MSLL
  • Fix major bugs in the scheduler

See details from #137.

ARK v0.3.0

04 Oct 02:00
6cb4b6e
Compare
Choose a tag to compare
  • Enable heuristic model graph optimization
  • Revise Python interfaces
  • Add more operators & support mixed-precision models & support bfloat16
  • Add a Llama2-7B example
  • Fix connection setup bugs for large & distributed models
  • Fix correctness bugs from a few operators
  • Minor scheduler improvements

See details from #113.

ARK v0.2.1

07 Sep 15:26
335da67
Compare
Choose a tag to compare
v0.2.1 (#116)

* Fix a header installation bug
* Check the Python version during build
* Resolve compilation warnings

ARK v0.2.0

05 Sep 08:43
693fe01
Compare
Choose a tag to compare

Timeline

Released Date: Sep. 5th, 2023

Work Items

Model

    • Interface: expose the underlying buffer info to Tensor (#79)

Communication Stack

    • Interface: hide GpuCommSw implementation from the interface (#81)
    • Interface: extend the current interface (#104)

Operators Support

    • Operator: add more operators (#62)
    • Operator: upgrade CUTLASS (#105)

Python

    • Interface: #96

Examples

    • Example: parallel matmul example (#64)

Bug Fix

Documents

CI

    • Code Coverage: add code coverage (#110)
    • Unit Tests: add a unit test pipeline (#88)
    • Unit Tests: #91

ARK v0.1.0

07 Aug 13:11
df98254
Compare
Choose a tag to compare

Features

  • Scheduler
    • The default scheduler
  • Communication
    • A simple software communication stack
  • Operators
    • Tensor
    • Reshape
    • Identity
    • Sharding
    • ReduceSum
    • ReduceMean
    • ReduceMax
    • Layernorm
    • Softmax
    • Transpose
    • Matmul
    • Im2col
    • Scale
    • Relu
    • Gelu
    • Add
    • Mul
    • Send
    • SendDone
    • Recv
    • SendMM
    • RecvMM
    • AllReduce
    • AllGather
  • Examples
    • Tutorials
    • A simple FFN training
    • Transformer inference
    • Megatron inference

New Contributors

  • @wusar made their first contribution in #1

Full Changelog: https://github.com/microsoft/ark/commits/v0.1.0