PyTorch Monarch icon
PyTorch Monarch icon

PyTorch Monarch

Distributed framework for scalable messaging using actor meshes, featuring efficient remote communication, supervision trees for fault tolerance, point-to-point RDMA GPU or CPU memory transfers, imperative Python API for actor and process creation, and distributed tensor support.

PyTorch Monarch screenshot 1

Cost / License

  • Free
  • Open Source

Platforms

  • Mac
  • Linux
  • Python
  • PyTorch
PyTorch Monarch screenshot 1
-
No reviews
0likes
0comments
0alternatives

Features

Suggest and vote on features

Properties

  1.  Scalable

Features

  1.  Error Logging
  2.  Python API
  3.  Python-based
  4.  Distributed
  5.  Fault tolerant
  6.  Support for NVIDIA CUDA acceleration
  7.  Unit Testing

 Tags

  • remote-actor
  • cluster
  • fault-tolerance
  • Software Framework
  • mesh
  • scalable-messaging-apis
  • python-framework
  • pytorch
  • actors
  • rust-api
  • rust-based

PyTorch Monarch News & Activities

Highlights All activities

Recent News

Show more news

Recent activities

Show all activities

PyTorch Monarch information

  • Developed by

    US flagpytorch
  • Licensing

    Open Source (BSD-3-Clause) and Free product.
  • Written in

  • Alternatives

    0 alternatives listed
  • Supported Languages

    • English

AlternativeTo Category

Development

GitHub repository

  •  921 Stars
  •  116 Forks
  •  400 Open Issues
  •   Updated  
View on GitHub
PyTorch Monarch was added to AlternativeTo by Paul on and this page was last updated .
No comments or reviews, maybe you want to be first?
Post comment/review

What is PyTorch Monarch?

Monarch is a distributed programming framework for PyTorch based on scalable actor messaging. It provides:

  • Remote actors with scalable messaging: Actors are grouped into collections called meshes and messages can be broadcast to all members.
  • Fault tolerance through supervision trees: Actors and processes for a tree and failures propagate up the tree, providing good default error behavior and enabling fine-grained fault recovery.
  • Point-to-point RDMA transfers: cheap registration of any GPU or CPU memory in a process, with the one-sided tranfers based on libibverbs
  • Distributed tensors: actors can work with tensor objects sharded across processes

Monarch code imperatively describes how to create processes and actors using a simple python API.

Monarch is currently only supported on Linux systems.

Official Links