Add Ray to Cluster computing (!1170) · Merge requests · Vinta Chen / awesome-python

Merged Administrator requested to merge github/fork/jmargeta/patch-1 into master Oct 22, 2018

Created by: jmargeta

What is this Python project?

Ray is a flexible, high-performance distributed execution framework.

Achieves parallelism in Python with simple and consistent API.
Particularly suited for machine learning (agnostic to the machine learning library of choice)
Forms the base of Ray's own distributed libraries for deep and reinforcement learning, processing of Pandas dataframes, and hyper parameter search.
Uses Plasma as its object store which allows to efficiently share large numpy arrays (or objects serializable with Apache Arrow) between the processes, avoiding unnecessary data copies and with only minimal deserialization

Less overhead and lower latency with bottom up scheduling than similar projects
Actors - allow sharing mutable state between tasks
Similar to Dask, for more details see a comparison here: https://github.com/ray-project/ray/issues/642

Anyone who agrees with this pull request could vote for it by adding a 👍 to it, and usually, the maintainer will merge it when votes reach 20.