Kubernetes Operator for MPI-based applications (distributed training, HPC, etc.)
-
Updated
Oct 16, 2024 - Go
Kubernetes Operator for MPI-based applications (distributed training, HPC, etc.)
TonY is a framework to natively run deep learning frameworks on Apache Hadoop.
A curated list of dedicated resources and applications
Neural network based solvers for partial differential equations and inverse problems 🌌. Implementation of physics-informed neural networks in pytorch.
Yet another speech toolkit based on Kaldi and PyTorch
This repository provides the Open-CE environment files and version definitions for each Open-CE release.
Code for tutorials and examples
Distributed, mixed-precision training with PyTorch
Multi-GPU training with TensorFlow on Piz Daint
This is a sub-repository in building to create acoustic model in Mandarin speech recognition.
Application of the L2HMC algorithm to simulations in lattice QCD.
Reimplement Deep Cell with Keras and Horovod.
Quick run NLP in many task 快速运行分类、序列标注、匹配、生成等NLP任务的Tensorflow框架 (中文 NLP 支持分布式)
Distributed Training of Bayesian Neural Networks at Scale
Distributed training of digital pathology tissue slide images using SageMaker and Horovod.
Create Horovod cluster easily using Ansible
GPU scheduler for elastic/distributed deep learning workloads in Kubernetes cluster (IC2E'23)
Add a description, image, and links to the horovod topic page so that developers can more easily learn about it.
To associate your repository with the horovod topic, visit your repo's landing page and select "manage topics."