Release Notes#
This section includes the release information for PyTorch-Plugin-FL.
v0.1.0#
Note
This is a preview release. The version number shown is a pre-release identifier and may change upon final release. Content in this preview is for reference only and does not constitute a commitment or warranty for the final product.
Added features:
Initial release of PyTorch-Plugin-FL as part of FlagOS.
PrivateUse1-based custom device plugin registering
flagosas a first-class PyTorch device.Automatic FlagGems Triton operator registration for the
flagosbackend.Per-operator configurable backend routing via
backends.confwith environment variable overrides.Multi-platform support: NVIDIA CUDA, MetaX MACA, Huawei Ascend.
Complete device management API: stream, event, RNG, AMP, memory allocator, DeviceGuard.
Lightweight C++ dispatch stub replacing PyTorch’s heavier DispatchStub.
C++ stub-only mode (
FLAGOS_DISABLE_FLAGGEMS_PY=1) for minimal overhead.Distributed training support via
torch_fl.distributed(DDP/FSDP patch).MACA cu-bridge ABI shim for symbol version compatibility.
Ascend NPU support with ACL NN API kernels.
Integration test suite with factory ops, dispatch routing, CPU fallback tracing, and Qwen3 inference/training tests.