Skip to content

Latest commit

 

History

History
285 lines (196 loc) · 8.24 KB

CHANGELOG.md

File metadata and controls

285 lines (196 loc) · 8.24 KB

Changelog

All notable changes to this project will be documented in this file.

The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.

[0.9.0a0] - 2024-11-XX

Added

  • Graph Transformer processor for GraphCast/GenCast.
  • Utility to generate STL from Signed Distance Field.
  • Metrics for CAE and CFD domain such as integrals, drag, and turbulence invariances and spectrum.
  • Added gradient clipping to StaticCapture utilities.
  • Bistride Multiscale MeshGraphNet example.
  • FIGConvUNet model and example.
  • The Transolver model.
  • The XAeroNet model.
  • Incoporated CorrDiff-GEFS-HRRR model into CorrDiff, with lead-time aware SongUNet and cross entropy loss.

Changed

  • Refactored CorrDiff training recipe for improved usability
  • Fixed timezone calculation in datapipe cosine zenith utility.
  • Refactored EDMPrecondSRV2 preconditioner and fixed the bug related to the metadata
  • Extended the checkpointing utility to store metadata.
  • Corrected missing export of loggin function used by transolver model

Deprecated

Removed

Fixed

Security

Dependencies

[0.8.0] - 2024-09-24

Added

  • Graph Transformer processor for GraphCast/GenCast.
  • Utility to generate STL from Signed Distance Field.
  • Metrics for CAE and CFD domain such as integrals, drag, and turbulence invariances and spectrum.
  • Added gradient clipping to StaticCapture utilities.
  • Bistride Multiscale MeshGraphNet example.

Changed

  • Refactored CorrDiff training recipe for improved usability
  • Fixed timezone calculation in datapipe cosine zenith utility.

[0.7.0] - 2024-07-23

Added

  • Code logging for CorrDiff via Wandb.
  • Augmentation pipeline for CorrDiff.
  • Regression output as additional conditioning for CorrDiff.
  • Learnable positional embedding for CorrDiff.
  • Support for patch-based CorrDiff training and generation (stochastic sampling only)
  • Enable CorrDiff multi-gpu generation
  • Diffusion model for fluid data super-resolution (CMU contribution).
  • The Virtual Foundry GraphNet.
  • A synthetic dataloader for global weather prediction models, demonstrated on GraphCast.
  • Sorted Empirical CDF CRPS algorithm
  • Support for history, cos zenith, and downscaling/upscaling in the ERA5 HDF5 dataloader.
  • An example showing how to train a "tensor-parallel" version of GraphCast on a Shallow-Water-Equation example.
  • 3D UNet
  • AeroGraphNet example of training of MeshGraphNet on Ahmed body and DrivAerNet datasets.
  • Warp SDF routine
  • DLWP HEALPix model
  • Pangu Weather model
  • Fengwu model
  • SwinRNN model
  • Modulated AFNO model

Changed

  • Raise ModulusUndefinedGroupError when querying undefined process groups
  • Changed Indexing error in examples/cfd/swe_nonlinear_pino for modulus loss function
  • Safeguarding against uninitialized usage of DistributedManager

Removed

  • Remove mlflow from deployment image

Fixed

  • Fixed bug in the partitioning logic for distributing graph structures intended for distributed message-passing.
  • Fixed bugs for corrdiff diffusion training of EDMv1 and EDMv2
  • Fixed bug when trying to save DDP model trained through unified recipe

Dependencies

  • Update DALI to CUDA 12 compatible version.
  • Update minimum python version to 3.10

[0.6.0] - 2024-04-17

Added

  • The citation file.
  • Link to the CWA dataset.
  • ClimateDatapipe: an improved datapipe for HDF5/NetCDF4 formatted climate data
  • Performance optimizations to CorrDiff.
  • Physics-Informed Nonlinear Shallow Water Equations example.
  • Warp neighbor search routine with a minimal example.
  • Strict option for loading Modulus checkpoints.
  • Regression only or diffusion only inference for CorrDiff.
  • Support for organization level model files on NGC file system
  • Physics-Informed Magnetohydrodynamics example.

Changed

  • Updated Ahmed Body and Vortex Shedding examples to use Hydra config.
  • Added more config options to FCN AFNO example.
  • Moved posiitonal embedding in CorrDiff from the dataloader to network architecture

Deprecated

  • modulus.models.diffusion.preconditioning.EDMPrecondSR. Use EDMPecondSRV2 instead.

Removed

  • Pickle dependency for CorrDiff.

Fixed

  • Consistent handling of single GPU runs in DistributedManager
  • Output location of objects downloaded with NGC file system
  • Bug in scaling the conditional input in CorrDiff deterministic sampler

Dependencies

  • Updated DGL build in Dockerfile
  • Updated default base image
  • Moved Onnx from optional to required dependencies
  • Optional Makani dependency required for SFNO model.

[0.5.0] - 2024-01-25

Added

  • Distributed process group configuration mechanism.
  • DistributedManager utility to instantiate process groups based on a process group config.
  • Helper functions to faciliate distributed training with shared parameters.
  • Brain anomaly detection example.
  • Updated Frechet Inception Distance to use Wasserstein 2-norm with improved stability.
  • Molecular Dynamics example.
  • Improved usage of GraphPartition, added more flexible ways of defining a partitioned graph.
  • Physics-Informed Stokes Flow example.
  • Profiling markers, benchmarking and performance optimizations for CorrDiff inference.
  • Unified weather model training example.

Changed

  • MLFLow logging such that only proc 0 logs to MLFlow.
  • FNO given seperate methods for constructing lift and spectral encoder layers.

Removed

  • The experimental SFNO

Dependencies

  • Removed experimental SFNO dependencies
  • Added CorrDiff dependencies (cftime, einops, pyspng, nvtx)
  • Made tqdm a required dependency

[0.4.0] - 2023-11-20

Added

  • Added Stokes flow dataset
  • An experimental version of SFNO to be used in unified training recipe for weather models
  • Added distributed FFT utility.
  • Added ruff as a linting tool.
  • Ported utilities from Modulus Launch to main package.
  • EDM diffusion models and recipes for training and sampling.
  • NGC model registry download integration into package/filesystem.
  • Denoising diffusion tutorial.

Changed

  • The AFNO input argument img_size to inp_shape
  • Integrated the network architecture layers from Modulus-Sym.
  • Updated the SFNO model, and the training and inference recipes.

Fixed

  • Fixed modulus.Module from_checkpoint to work from custom model classes

Dependencies

  • Updated the base container to PyTorch 23.10.
  • Updated examples to use Pydantic v2.

[0.3.0] - 2023-09-21

Added

  • Added ability to compute CRPS(..., dim: int = 0).
  • Added EFI for arbitrary climatological CDF.
  • Added Kernel CRPS implementation (kcrps)
  • Added distributed utilities to create process groups and orthogonal process groups.
  • Added distributed AFNO model implementation.
  • Added distributed utilities for communication of buffers of varying size per rank.
  • Added distributed utilities for message passing across multiple GPUs.
  • Added instructions for docker build on ARM architecture.
  • Added batching support and fix the input time step for the DLWP wrapper.

Changed

  • Updating file system cache location to modulus folder

Fixed

  • Fixed modulus uninstall in CI docker image

Security

  • Handle the tar ball extracts in a safer way.

Dependencies

  • Updated the base container to latest PyTorch 23.07.
  • Update DGL version.
  • Updated require installs for python wheel
  • Added optional dependency list for python wheel

[0.2.1] - 2023-08-08

Fixed

  • Added a workaround fix for the CUDA graphs error in multi-node runs

Security

  • Update certifi package version

[0.2.0] - 2023-08-07

Added

  • Added a CHANGELOG.md
  • Added build support for internal DGL
  • 4D Fourier Neural Operator model
  • Ahmed body dataset
  • Unified Climate Datapipe

Changed

  • DGL install changed from pypi to source
  • Updated SFNO to add support for super resolution, flexible checkpoining, etc.

Fixed

  • Fixed issue with torch-harmonics version locking
  • Fixed the Modulus editable install
  • Fixed AMP bug in static capture

Security

  • Fixed security issues with subprocess and urllib in filesystem.py

Dependencies

  • Updated the base container to latest PyTorch base container which is based on torch 2.0
  • Container now supports CUDA 12, Python 3.10

[0.1.0] - 2023-05-08

Added

  • Initial public release.