Skip to content

Community version of Swarm learning - v2.1.0

Compare
Choose a tag to compare
@suresh-ls suresh-ls released this 11 Oct 16:37
· 176 commits to master since this release
9cbb0bd

In this release, we have delivered key enhancements on SLM-UI (model training metrics, easy browse through of ML logs and centralized swarm log collector), that will significantly enhance user experience. For advanced Swarm Learning users, we have provided couple of additional options for merge algorithms. These will help optimize on training convergence for different customer workloads. Further, we have enabled persistence for blockchain data, which will benefit customers with offline analysis of training related data, faster restart of Swarm network (SN).

One can download product bits and documentation, from My HPE Software Center (https://myenterpriselicense.hpe.com/cwp-ui/auth/login),

Here are the key contents of this release :

Features:
• Persistent data in SN
o Make the SN blockchain persist on disk

• UI/UX Features
o Model training metrics – Accuracy, Loss etc. at SL node and global Swarm level
o Browse through ML container logs
o Centralized Swarm log collector for faster diagnostic collection
o Seamless Product upgrade

• New merge methods for Swarm merge process
o Co-ordinate Median, Geometric Median
o Configurable merge through I/O or Memory optimized modes

• Swarm on Podman (alternative for Docker)
o Support Podman container runtime
o Run Swarm containers with rootless privileges
o Added support for SELinux with Podman on RHEL

• Enhanced diagnostics for SWOP and SN
• Containerized License Server (APLS)
• Documentation and example updates

Defect fixes:
• Defect fixes in SN restart path
• Corrected ‘LIST NODES’ to display only active nodes
• Swarm components exits with proper diagnostics if certificates are expired
• Swarm Learning Topology updated to reflect active nodes
• Reverse proxy updates to consider the port number along with service name