v0.17.0 (2024-07-12)
- [SDK] Drop Python 3.7 and Support Python 3.11 (#2337 by @tenzen-y)
- [SDK] Upgrade the protobuf version to >=4.21.12,<5 (#2358 by @tenzen-y)
- Drop Kubernetes v1.26, and support Kubernetes v1.29 (#2308 by @tenzen-y)
- Drop Kubernetes v1.25, and Support Kubernetes v1.28 (#2303 by @tenzen-y)
- Remove MXNet examples (#2267 by @tenzen-y)
- Replace gRPC code generation tool from Znly/protoc to Buf (#2344 by @forsaken628)
- Support ARM64 arch for release images (#2315 by @andreyvelich)
- DB: Add environment variable option to skip DB table creationˆ (#2245 by @lkaybob)
- Add environment variable option to set postgres ssl mode (#2266 by @ckcd)
- Upgrade TensorFlow version to v2.16.1 (#2282 by @tenzen-y)
- Upgrade PyTorch version to v2.2.1 (#2279 by @tenzen-y)
- [SDK] Generate Name functionality for creating experiments. (#2272 by @bharathk005)
- [SDK] Add
env
&env_from
in client tune (#2235 by @shipengcheng1230) - [SDK] Add 'algorithm_settings' in client tune (#2227 by @shipengcheng1230)
- [SDK] Raise more human-readable name conflict exception (#2199 by @droctothorpe)
- Remove code generation from release script (#2364 by @andreyvelich)
- [SDK] Fix empty list for env variables and numpy version (#2360 by @andreyvelich)
- Use cache-dependency-path in actions/setup-go for CI workflow (#2355 by @forsaken628)
- Fix TestReconcileBatchJob (#2350 by @forsaken628)
- Fix Scikit-Learn Version for Skopt Tests (#2336 by @andreyvelich)
- [SDK] Fix env per Trial parameter in tune API (#2304 by @andreyvelich)
- Fix: clean up UTs for file metrics collector (#2285 by @Electronic-Waste)
- Fix tensor devices for DARTS Trial (#2273 by @sifa1024)
- Typo fix stale.yaml (#2257 by @tarilabs)
- Fix Optuna Validation for CMA-ES (#2240 by @andreyvelich)
- Replace already closed github.com/golang/mock with go.uber.org/mock (#2357 by @forsaken628)
- Update outdated actions (#2324 by @Mersho)
- Upgrade Go version to v1.22 (#2309 by @tenzen-y)
- CI: Enable parallel mode for the coveralls (#2297 by @tenzen-y)
- Upgrade Python version to 3.11 (#2278 by @tenzen-y)
- chore: add unit testcases for files in Text format. (#2274 by @Electronic-Waste)
- Upgrade google/go-containerregistry/pkg/authn/k8schain (#2252 by @tenzen-y)
- Add Technical and style guide to the contribution guide (#2250 by @tenzen-y)
- Install typing-extensions v4.6.3 for Optuna (#2251 by @tenzen-y)
- Remove legacy BO code (#2246 by @andreyvelich)
- Add Changelog for Katib v0.16.0 (#2239 by @andreyvelich)
- Add Katib ROADMAP 2022/2023 (#2153 by @andreyvelich)
- Update Ubuntu to 22.04 for E2E Tests (#2222 by @andreyvelich)
- Run Stale Action Every 5th Hour (#2221 by @andreyvelich)
- Add Stale GitHub Action (#2220 by @andreyvelich)
- Add Changelog for Katib v0.16.0-rc.1 (#2218 by @andreyvelich)
- Add Changelog for Katib v0.16.0-rc.0 (#2204 by @andreyvelich)
- Use the controller-runtime logger in the cert-generator (#2219 by @tenzen-y)
v0.17.0-rc.1 (2024-06-20)
- [SDK] Drop Python 3.7 and Support Python 3.11 (#2337 by @tenzen-y)
- [SDK] Upgrade the protobuf version to >=4.21.12,<5 (#2358 by @tenzen-y)
- Replace gRPC code generation tool from Znly/protoc to Buf (#2344 by @forsaken628)
- Remove code generation from release script (#2364 by @andreyvelich)
- [SDK] Fix empty list for env variables and numpy version (#2360 by @andreyvelich)
- Use cache-dependency-path in actions/setup-go for CI workflow (#2355 by @forsaken628)
- Fix TestReconcileBatchJob (#2350 by @forsaken628)
- Fix Scikit-Learn Version for Skopt Tests (#2336 by @andreyvelich)
- Replace already closed github.com/golang/mock with go.uber.org/mock (#2357 by @forsaken628)
- Update outdated actions (#2324 by @Mersho)
v0.17.0-rc.0 (2024-04-29)
- Drop Kubernetes v1.26, and support Kubernetes v1.29 (#2308 by @tenzen-y)
- Drop Kubernetes v1.25, and Support Kubernetes v1.28 (#2303 by @tenzen-y)
- Support ARM64 arch for release images (#2315 by @andreyvelich)
- DB: Add environment variable option to skip DB table creationˆ (#2245 by @lkaybob)
- Add environment variable option to set postgres ssl mode (#2266 by @ckcd)
- Upgrade TensorFlow version to v2.16.1 (#2282 by @tenzen-y)
- Upgrade PyTorch version to v2.2.1 (#2279 by @tenzen-y)
- [SDK] Generate Name functionality for creating experiments. (#2272 by @bharathk005)
- [SDK] Add
env
&env_from
in client tune (#2235 by @shipengcheng1230) - [SDK] Add 'algorithm_settings' in client tune (#2227 by @shipengcheng1230)
- [SDK] Raise more human-readable name conflict exception (#2199 by @droctothorpe)
- [SDK] Fix env per Trial parameter in tune API (#2304 by @andreyvelich)
- Fix: clean up UTs for file metrics collector (#2285 by @Electronic-Waste)
- Fix tensor devices for DARTS Trial (#2273 by @sifa1024)
- Typo fix stale.yaml (#2257 by @tarilabs)
- Fix Optuna Validation for CMA-ES (#2240 by @andreyvelich)
- Upgrade Go version to v1.22 (#2309 by @tenzen-y)
- CI: Enable parallel mode for the coveralls (#2297 by @tenzen-y)
- Upgrade Python version to 3.11 (#2278 by @tenzen-y)
- chore: add unit testcases for files in Text format. (#2274 by @Electronic-Waste)
- Upgrade google/go-containerregistry/pkg/authn/k8schain (#2252 by @tenzen-y)
- Remove MXNet examples (#2267 by @tenzen-y)
- Add Technical and style guide to the contribution guide (#2250 by @tenzen-y)
- Install typing-extensions v4.6.3 for Optuna (#2251 by @tenzen-y)
- Remove legacy BO code (#2246 by @andreyvelich)
- Add Changelog for Katib v0.16.0 (#2239 by @andreyvelich)
- Add Katib ROADMAP 2022/2023 (#2153 by @andreyvelich)
- Update Ubuntu to 22.04 for E2E Tests (#2222 by @andreyvelich)
- Run Stale Action Every 5th Hour (#2221 by @andreyvelich)
- Add Stale GitHub Action (#2220 by @andreyvelich)
- Add Changelog for Katib v0.16.0-rc.1 (#2218 by @andreyvelich)
- Add Changelog for Katib v0.16.0-rc.0 (#2204 by @andreyvelich)
- Use the controller-runtime logger in the cert-generator (#2219 by @tenzen-y)
v0.16.0 (2023-10-31)
- Implement KatibConfig API (#2176 by @tenzen-y)
- Drop Kubernetes v1.24 and support Kubernetes v1.27 (#2182 by @tenzen-y)
- Drop Kubernetes v1.23 and support Kubernetes v1.26 (#2177 by @tenzen-y)
- Change failurePolicy to Fail for Katib Webhooks (#2018 by @andreyvelich)
- Consolidate the Katib Cert Generator to the Katib Controller (#2185 by @tenzen-y)
- Containerize tests for Katib Conformance (#2146 by @nagar-ajay)
- [UI] Default Resume Policy to never from UI (#2195 by @mChowdhury-91)
- [UI] Remove Deprecated Katib UI (#2179 by @andreyvelich)
- [UI] Fix Trial Logs when Kubernetes Job Fails (#2164 by @andreyvelich)
- kwa(front): Support all namespaces (#2119 by @elenzio9)
- kwa(front): Update the use of SnackBarService (#2113 by @orfeas-k)
- UI: Remove an unsed import, EventV1beta1Api (#2116 by @tenzen-y)
- [SDK] Enable resource specification for trial containers (#2192 by @droctothorpe)
- [SDK] Add namespace parameter to KatibClient (#2183 by @droctothorpe)
- [SDK] Import all Kubernetes Models (#2148 by @andreyvelich)
- Bug: Wait for the certs to be mounted inside the container (#2213 by @tenzen-y)
- Start waiting for certs to be ready before sending data to the channel (#2215 by @tenzen-y)
- E2E: Add additional checks to verify if the components are ready (#2212 by @tenzen-y)
- Remove a katib-webhook-cert Secret from components (#2214 by @tenzen-y)
- Skip to inject the metrics-collector pods to the Katib controller (#2211 by @tenzen-y)
- Sending an empty data to the certsReady channel (#2196 by @tenzen-y)
- Fix conformance docker image (#2147 by @nagar-ajay)
- Add PITS Global Data Recovery Services to the adopters list (#2160 by @ghost)
- Add SDK Breaking Change to Changelog (#2133 by @andreyvelich)
- Add Changelog for Katib v0.15.0 (#2129 by @andreyvelich)
- Add Changelog for Katib v0.15.0-rc.1 (#2123 by @andreyvelich)
- Add Changelog for Katib v0.15.0-rc.0 (#2106 by @andreyvelich)
- Upgrade Tensorflow version to v2.13.0 (#2216 by @tenzen-y)
- Upgrade Go version to v1.20 (#2190 by @tenzen-y)
- Replace grpc_health_probe with the built-in gRPC container probe feature (#2189 by @tenzen-y)
- Allow install binaries for the arm64 in the envtest (#2188 by @tenzen-y)
- Replace action to setup minikube with medyagh/setup-minikube (#2178 by @tenzen-y)
- Remove Charmed Operators for Katib (#2161 by @ca-scribner)
- Namespace and trial pod annotations as CLI argument (#2138 by @nagar-ajay)
- Relax dependencies restriction for the gRPC libraries (#2140 by @tenzen-y)
- Add SDK Breaking Change to Changelog (#2133 by @andreyvelich)
- Increase the free spaces in CI (#2131 by @tenzen-y)
- Reformat katib-operators (#2114 by @tenzen-y)
v0.16.0-rc.1 (2023-08-16)
- Bug: Wait for the certs to be mounted inside the container (#2213 by @tenzen-y)
- Start waiting for certs to be ready before sending data to the channel (#2215 by @tenzen-y)
- E2E: Add additional checks to verify if the components are ready (#2212 by @tenzen-y)
- Remove a katib-webhook-cert Secret from components (#2214 by @tenzen-y)
- Skip to inject the metrics-collector pods to the Katib controller (#2211 by @tenzen-y)
v0.16.0-rc.0 (2023-08-05)
- Implement KatibConfig API (#2176 by @tenzen-y)
- Drop Kubernetes v1.24 and support Kubernetes v1.27 (#2182 by @tenzen-y)
- Drop Kubernetes v1.23 and support Kubernetes v1.26 (#2177 by @tenzen-y)
- Change failurePolicy to Fail for Katib Webhooks (#2018 by @andreyvelich)
- Consolidate the Katib Cert Generator to the Katib Controller (#2185 by @tenzen-y)
- Containerize tests for Katib Conformance (#2146 by @nagar-ajay)
- [UI] Default Resume Policy to never from UI (#2195 by @mChowdhury-91)
- [UI] Remove Deprecated Katib UI (#2179 by @andreyvelich)
- [UI] Fix Trial Logs when Kubernetes Job Fails (#2164 by @andreyvelich)
- kwa(front): Support all namespaces (#2119 by @elenzio9)
- kwa(front): Update the use of SnackBarService (#2113 by @orfeas-k)
- UI: Remove an unsed import, EventV1beta1Api (#2116 by @tenzen-y)
- [SDK] Enable resource specification for trial containers (#2192 by @droctothorpe)
- [SDK] Add namespace parameter to KatibClient (#2183 by @droctothorpe)
- [SDK] Import all Kubernetes Models (#2148 by @andreyvelich)
- Sending an empty data to the certsReady channel (#2196 by @tenzen-y)
- Fix conformance docker image (#2147 by @nagar-ajay)
- Add PITS Global Data Recovery Services to the adopters list (#2160 by @ghost)
- Add SDK Breaking Change to Changelog (#2133 by @andreyvelich)
- Add Changelog for Katib v0.15.0 (#2129 by @andreyvelich)
- Add Changelog for Katib v0.15.0-rc.1 (#2123 by @andreyvelich)
- Add Changelog for Katib v0.15.0-rc.0 (#2106 by @andreyvelich)
- Upgrade Go version to v1.20 (#2190 by @tenzen-y)
- Replace grpc_health_probe with the built-in gRPC container probe feature (#2189 by @tenzen-y)
- Allow install binaries for the arm64 in the envtest (#2188 by @tenzen-y)
- Replace action to setup minikube with medyagh/setup-minikube (#2178 by @tenzen-y)
- Remove Charmed Operators for Katib (#2161 by @ca-scribner)
- Namespace and trial pod annotations as CLI argument (#2138 by @nagar-ajay)
- Relax dependencies restriction for the gRPC libraries (#2140 by @tenzen-y)
- Add SDK Breaking Change to Changelog (#2133 by @andreyvelich)
- Increase the free spaces in CI (#2131 by @tenzen-y)
- Reformat katib-operators (#2114 by @tenzen-y)
v0.15.0 (2023-03-22)
- Use Never Resume Policy as Default (#2102 by @andreyvelich)
- Chocolate Suggestion Service is removed (#2071 by @tenzen-y)
request_number
is removed from the GRPC APIs (#1994 by @johnugeorge)- Enabling Authorization in Katib UI (#1983 and #2041 by @apo-ger)
- The new improved and refactored Katib SDK is not backward compatible (#2075 by @andreyvelich)
- Narrow down Katib RBAC rules (#2091 by @johnugeorge)
- Support Postgres as a Katib DB (#1921 by @anencore94)
- More Suggestion container fields in Katib Config (#2000 by @fischor)
- Katib UI: Create the LOGS tab of Trial's details page (#2117 by @elenzio9)
- Katib UI: Enable pagination/sorting/filtering (#2017 and #2040 by @elenzio9)
- [SDK] Create Tune API in the Katib SDK (#1951 by @andreyvelich)
- [SDK] Get Trial Metrics from Katib DB (#2050 by @andreyvelich)
- Add Conformance Program Doc for AutoML and Training WG (#2048 by @andreyvelich)
- Support for grid search algorithm in Optuna Suggestion Service (#2060 by @tenzen-y)
- Add Trial Labels During Pod Mutation (#2047 by @andreyvelich)
- Support for k8s v1.25 in CI (#1997 by @johnugeorge)
- Add the CI to build multi-platform container images (#1956 by @tenzen-y)
- Drop Kubernetes v1.21 and introduce Kubernetes v1.24 (#1953 by @tenzen-y)
- Add --connect-timeout flag to katib-db-manager (#1937 by @tenzen-y)
- Implement validations for DARTS suggestion service (#1926 by @tenzen-y)
- Implement validation for Optuna suggestion service (#1924 by @tenzen-y)
- Make links in KWA's tables actual links (#2090 by @elenzio9)
- frontend: Rework the trial graph using ECharts in KWA (#2089 by @elenzio9)
- kwa(front): Add UI tests with Cypress (#2088 by @orfeas-k)
- frontend: Enable actions in experiment graph (#2065 by @elenzio9)
- frontend: Show message in case of uncompleted trial instead of the graph (#2063 by @elenzio9)
- frontend: Add source maps in the browser (#2043 by @elenzio9)
- Backend for getting logs of a trial (#2039 by @d-gol)
- frontend: Show the successful trials in the experiment graph (#2013) (#2033 by @elenzio9)
- frontend: Migrate from tslint to eslint in KWA (#2042 by @elenzio9)
- Dedicated yaml tab for Trials (#2034 by @elenzio9)
- KWA: Use new Editor component (Monaco) (#2023 by @orfeas-k)
- kwa(build): Introduce COMMIT file for building KWA (#2014 by @orfeas-k)
- frontend: Fix 500 error after detail page refresh (#1967) (#2001 by @elenzio9)
- Introduce KWA's frontend component for kfp links (#1991 by @elenzio9)
- UI: Rename and right align the age column (#1989 by @elenzio9)
- Show the trials table's status column first (#1990 by @elenzio9)
- UI: Make KWA's main table responsive and add toolbar (#1982 by @elenzio9)
- UI: Fix unit tests (#1977 by @elenzio9)
- UI: Format code (#1979 by @orfeas-k)
- Recreate the Experiments Parallel Coordinates Graph (#1974 by @elenzio9)
- Improve UI API/controller logging to ease troubleshooting (#1966 by @lukeogg)
- [SDK] Use Katib SDK for E2E Tests (#2075 by @andreyvelich)
- [SDK] Use Katib Client without Kube Config (#2098 by @andreyvelich)
- [SDK] Fix namespace parameter in tune API (#1981 by @andreyvelich)
- [SDK] Remove Final Keyword from constants (#1980 by @andreyvelich)
- Fix Release Script for Updating SDK Version (#2104 by @andreyvelich)
- [Fix] add early stopped trials in converter (#2004 by @shaowei-su)
- [bugfix] Fix value passing bug in New Experiment form (#2027 by @orfeas-k)
- Fix main process retrieve logic for early stopping (#1988 by @shaowei-su)
- [hotfix]: filter by name of experiment (#1920 by @anencore94)
- Fix push script to include new images (#1911 by @johnugeorge)
- fix: only validate Kubernetes Job (#2025 by @zhixian82)
- Upgrade grpc-health-probe version to fix some security issues (#2093 by @tenzen-y)
- Format Katib Charm Operator (#2115 by @tenzen-y)
- Add CERN to adopters (#2010 by @d-gol)
- Add More Katib Presentations 2022 (#2009 by @andreyvelich)
- Add the documentation for simple-pbt (#1978 by @tenzen-y)
- Add the license to pbt (#1958 by @tenzen-y)
- Update the Katib version in docs (#1950 by @tenzen-y)
- Update CHANGELOG for v0.14.0 release (#1932 by @johnugeorge)
- Update Training operator Image in CI (#2103 by @johnugeorge)
- Upgrade Go libraries to resolve security issues (#2094 by @tenzen-y)
- Run e2e with various Python versions to verify Python SDK (#2092 by @tenzen-y)
- Add a --prefer-binary flag to 'pip install' command (#2096 by @tenzen-y)
- Upgrade PyTorch version to v1.13.0 (#2082 by @tenzen-y)
- Upgrade Tensorflow version (#2079 by @tenzen-y)
- Upgrade Python version to 3.10 (#2057 by @tenzen-y)
- Pin the NumPy version with v1.23.5 in some images (#2070 by @tenzen-y)
- Upgrade the actions-setup-minikube version to v2.7.2 (#2064 by @tenzen-y)
- Remove Certificate Chain from Cert Generator (#2045 by @andreyvelich)
- Add resources to earlystopping container (#2038 by @zhixian82)
- Add scripts to verify generated codes and Go Modules (#1999 by @tenzen-y)
- [Test] Reduce Katib GitHub Action Runs (#2036 by @andreyvelich)
- gh-actions: Extend action to run Frontend Unit tests (#1998 by @orfeas-k)
- [chore] Upgrade docker/metadata-action, actions/checkout, and actions/setup-python version (#1996 by @tenzen-y)
- [chore] Upgrade Go version to v1.19 (#1995 by @tenzen-y)
- Support for arm64 in simple-pbt image (#1948 by @tenzen-y)
- Support arm64 in darts-cnn-cifar10 image (#1947 by @tenzen-y)
- Support for arm64 in enas-cnn-cifar10 image (#1944 by @tenzen-y)
- Support for arm64 in pytorch-mnist image (#1943 by @tenzen-y)
- Support for arm64 in mxnet-mnist image (#1940 by @tenzen-y)
- Use the katib-new-ui for Charmed gh-actions (#1987 by @tenzen-y)
- [feat] health check for katib-controller (#1934 by @anencore94)
- Upgrade Optuna from v2.x.x to v3.0.0 (#1942 by @keisuke-umezawa)
- Add validation webhooks for maxFailedTrialCount and parallelTrialCount (#1936 by @tenzen-y)
- Introduce Automatic platform ARGs (#1935 by @tenzen-y)
- Update training operator image in CI (#1933 by @johnugeorge)
- Update Katib SDK version (#1931 by @johnugeorge)
- [chore] Upgrade Go version to v1.18 (#1925 by @tenzen-y)
- Add the pytorch-mnist with GPU support container image (#1916 by @tenzen-y)
v0.15.0-rc.1 (2023-02-15)
v0.15.0-rc.0 (2023-01-27)
- Use Never Resume Policy as Default (#2102 by @andreyvelich)
- Chocolate Suggestion Service is removed (#2071 by @tenzen-y)
request_number
is removed from the GRPC APIs (#1994 by @johnugeorge)- The new improved and refactored Katib SDK is not backward compatible (#2075 by @andreyvelich)
- Narrow down Katib RBAC rules (#2091 by @johnugeorge)
- Support Postgres as a Katib DB (#1921 by @anencore94)
- More Suggestion container fields in Katib Config (#2000 by @fischor)
- Katib UI: Enable pagination/sorting/filtering (#2017 and #2040 by @elenzio9)
- Katib UI: Add authorization mechanisms (#1983 by @apo-ger)
- [SDK] Create Tune API in the Katib SDK (#1951 by @andreyvelich)
- [SDK] Get Trial Metrics from Katib DB (#2050 by @andreyvelich)
- Add Conformance Program Doc for AutoML and Training WG (#2048 by @andreyvelich)
- Support for grid search algorithm in Optuna Suggestion Service (#2060 by @tenzen-y)
- Add Trial Labels During Pod Mutation (#2047 by @andreyvelich)
- Support for k8s v1.25 in CI (#1997 by @johnugeorge)
- Add the CI to build multi-platform container images (#1956 by @tenzen-y)
- Drop Kubernetes v1.21 and introduce Kubernetes v1.24 (#1953 by @tenzen-y)
- Add --connect-timeout flag to katib-db-manager (#1937 by @tenzen-y)
- Implement validations for DARTS suggestion service (#1926 by @tenzen-y)
- Implement validation for Optuna suggestion service (#1924 by @tenzen-y)
- Make links in KWA's tables actual links (#2090 by @elenzio9)
- frontend: Rework the trial graph using ECharts in KWA (#2089 by @elenzio9)
- kwa(front): Add UI tests with Cypress (#2088 by @orfeas-k)
- Update manifests to enable authorization check mechanisms for Katib UI in Kubeflow mode (#2041 by @apo-ger)
- frontend: Enable actions in experiment graph (#2065 by @elenzio9)
- frontend: Show message in case of uncompleted trial instead of the graph (#2063 by @elenzio9)
- frontend: Add source maps in the browser (#2043 by @elenzio9)
- Backend for getting logs of a trial (#2039 by @d-gol)
- frontend: Show the successful trials in the experiment graph (#2013) (#2033 by @elenzio9)
- frontend: Migrate from tslint to eslint in KWA (#2042 by @elenzio9)
- Dedicated yaml tab for Trials (#2034 by @elenzio9)
- KWA: Use new Editor component (Monaco) (#2023 by @orfeas-k)
- kwa(build): Introduce COMMIT file for building KWA (#2014 by @orfeas-k)
- frontend: Fix 500 error after detail page refresh (#1967) (#2001 by @elenzio9)
- Introduce KWA's frontend component for kfp links (#1991 by @elenzio9)
- UI: Rename and right align the age column (#1989 by @elenzio9)
- Show the trials table's status column first (#1990 by @elenzio9)
- UI: Make KWA's main table responsive and add toolbar (#1982 by @elenzio9)
- UI: Fix unit tests (#1977 by @elenzio9)
- UI: Format code (#1979 by @orfeas-k)
- Recreate the Experiments Parallel Coordinates Graph (#1974 by @elenzio9)
- Improve UI API/controller logging to ease troubleshooting (#1966 by @lukeogg)
- [SDK] Use Katib SDK for E2E Tests (#2075 by @andreyvelich)
- [SDK] Use Katib Client without Kube Config (#2098 by @andreyvelich)
- [SDK] Fix namespace parameter in tune API (#1981 by @andreyvelich)
- [SDK] Remove Final Keyword from constants (#1980 by @andreyvelich)
- Fix Release Script for Updating SDK Version (#2104 by @andreyvelich)
- [Fix] add early stopped trials in converter (#2004 by @shaowei-su)
- [bugfix] Fix value passing bug in New Experiment form (#2027 by @orfeas-k)
- Fix main process retrieve logic for early stopping (#1988 by @shaowei-su)
- [hotfix]: filter by name of experiment (#1920 by @anencore94)
- Fix push script to include new images (#1911 by @johnugeorge)
- fix: only validate Kubernetes Job (#2025 by @zhixian82)
- Upgrade grpc-health-probe version to fix some security issues (#2093 by @tenzen-y)
- Add CERN to adopters (#2010 by @d-gol)
- Add More Katib Presentations 2022 (#2009 by @andreyvelich)
- Add the documentation for simple-pbt (#1978 by @tenzen-y)
- Add the license to pbt (#1958 by @tenzen-y)
- Update the Katib version in docs (#1950 by @tenzen-y)
- Update CHANGELOG for v0.14.0 release (#1932 by @johnugeorge)
- Update Training operator Image in CI (#2103 by @johnugeorge)
- Upgrade Go libraries to resolve security issues (#2094 by @tenzen-y)
- Run e2e with various Python versions to verify Python SDK (#2092 by @tenzen-y)
- Add a --prefer-binary flag to 'pip install' command (#2096 by @tenzen-y)
- Upgrade PyTorch version to v1.13.0 (#2082 by @tenzen-y)
- Upgrade Tensorflow version (#2079 by @tenzen-y)
- Upgrade Python version to 3.10 (#2057 by @tenzen-y)
- Pin the NumPy version with v1.23.5 in some images (#2070 by @tenzen-y)
- Upgrade the actions-setup-minikube version to v2.7.2 (#2064 by @tenzen-y)
- Remove Certificate Chain from Cert Generator (#2045 by @andreyvelich)
- Add resources to earlystopping container (#2038 by @zhixian82)
- Add scripts to verify generated codes and Go Modules (#1999 by @tenzen-y)
- [Test] Reduce Katib GitHub Action Runs (#2036 by @andreyvelich)
- gh-actions: Extend action to run Frontend Unit tests (#1998 by @orfeas-k)
- [chore] Upgrade docker/metadata-action, actions/checkout, and actions/setup-python version (#1996 by @tenzen-y)
- [chore] Upgrade Go version to v1.19 (#1995 by @tenzen-y)
- Support for arm64 in simple-pbt image (#1948 by @tenzen-y)
- Support arm64 in darts-cnn-cifar10 image (#1947 by @tenzen-y)
- Support for arm64 in enas-cnn-cifar10 image (#1944 by @tenzen-y)
- Support for arm64 in pytorch-mnist image (#1943 by @tenzen-y)
- Support for arm64 in mxnet-mnist image (#1940 by @tenzen-y)
- Use the katib-new-ui for Charmed gh-actions (#1987 by @tenzen-y)
- [feat] health check for katib-controller (#1934 by @anencore94)
- Upgrade Optuna from v2.x.x to v3.0.0 (#1942 by @keisuke-umezawa)
- Add validation webhooks for maxFailedTrialCount and parallelTrialCount (#1936 by @tenzen-y)
- Introduce Automatic platform ARGs (#1935 by @tenzen-y)
- Update training operator image in CI (#1933 by @johnugeorge)
- Update Katib SDK version (#1931 by @johnugeorge)
- [chore] Upgrade Go version to v1.18 (#1925 by @tenzen-y)
- Add the pytorch-mnist with GPU support container image (#1916 by @tenzen-y)
v0.14.0 (2022-08-18)
- Population based training (#1833 by @a9p)
- Support JSON format logs in
file-metrics-collector
(#1765 by @tenzen-y) - Include MetricsUnavailable condition to Complete in Trial (#1877 by @tenzen-y)
- Allow running examples on Apple Silicon M1 and fix image build errors for arm64 (#1898 by @tenzen-y)
- Configurable job name and service name for cert generator (#1889 by @shaowei-su)
- Add PBT to experiment creation form (#1909 by @a9p)
- Distinct page for each Trial in the UI (#1783 by @d-gol)
- Add the pytorch-mnist with GPU support container image (#1917 by @tenzen-y)
- Fix push script to include new images (#1912 by @johnugeorge)
- Fixes lint warnings in YAML files (#1902 by @Rishit-dagli)
- Fix errors when running the test on Apple Silicon M1 (#1886 by @tenzen-y)
- Reconcile trial assignments by comparing suggestion and trials being executed (#1831 by @henrysecond1)
- Increate the probes seconds in manifests (#1845 by @haoxins)
- Set upper constraint for Optuna (#1852 by @himkt)
- Don't check if trial's metadata is in spec.parameters (#1848 by @alexeygorobets)
- Fix the FPGA examples documentation (#1841 by @eliaskoromilas)
- Add CyberAgent to adopters (#1894 by @tenzen-y)
- Updating the training operator image in CI (#1910 by @johnugeorge)
- Upgrade Python and Pytorch versions for some examples (#1906 by @tenzen-y)
- Linting for K8s YAML files (#1901 by @Rishit-dagli)
- Change integration test sysytem from KinD Cluster to Minikube Cluster (#1899 by @tenzen-y)
- Upgrade mysql version to v8.0.29 (#1897 by @tenzen-y)
- Upgrade tensorflow-aarch64 version to v2.9.1 (#1891 by @tenzen-y)
- chore: Upgrade Go libraries to resolve some security issues in the katib-controller (#1888 by @tenzen-y)
- Migrate kubeflow-katib-presubmit to GitHub Actions (#1882 by @tenzen-y)
- Add semicolon when using
command
command in Makefile (#1885 by @tenzen-y) - Fix
HAS_SHELLCHECK
andHAS_SETUP_ENVTEST
in Makefile (#1884 by @tenzen-y) - Remove presubmit tests depending on optional-test-infra (#1871 by @aws-kf-ci-bot)
- Upgrade the Tensorflow version to address some security issues (#1870 by @tenzen-y)
- Upgrade the grpc_health_probe version to v0.4.11 to resolve security vulnerability CVE-2022-27191 (#1875 by @tenzen-y)
- additional metric names should not include objective metric name (#1874 by @henrysecond1)
- Upgrade the Kubernetes Python client to 22.6.0 (#1869 by @tenzen-y)
- Upgrade the kubebuilder to v3.2.0 and Kubernetes Go libraries to v1.22.2 (#1861 by @tenzen-y)
- Update FPGA XGBoost example (#1865 by @eliaskoromilas)
- Fix kubeflowkatib/mxnet-mnist image (#1866 by @tenzen-y)
- pins pip and setuptools versions operators to avoid installation issues (#1867 by @DnPlas)
- Add shellcheck (#1857 by @tenzen-y)
- Bump kubeflow-katib and kfp version in notebook examples (#1849 by @tenzen-y)
- Add prometheus scraping and grafana support to charmed katib-controller operator (#1839 by @jardon)
- Upgrade Black to fix linting (#1842 by @jardon)
v0.13.0 (2022-03-04)
- Implement validation for Early Stopping (#1709 by @tenzen-y)
- Change namespace label for Metrics Collector injection (#1740 by @andreyvelich)
- Modify gRPC API with Current Request Number (#1728 by @andreyvelich)
- Allow to remove each resource in Katib config (#1729 by @andreyvelich)
- Support leader election for Katib Controller (#1713 by @tenzen-y)
- Change default Metrics Collect format (#1707 by @anencore94)
- Bump Python version to 3.9 (#1731 by @tenzen-y)
- Update Go version to 1.17 (#1683 by @andreyvelich)
- Create Python script to run e2e Argo Workflow (#1674 by @andreyvelich)
- Reimplement Katib Cert Generator in Go (#1662 by @tenzen-y)
- SDK: change list apis to return objects as default (#1630 by @anencore94)
- Enhance Katib UI feasible space (#1721 by @seong7)
- Handle missing TrialTemplates in Katib UI (#1652 by @kimwnasptd)
- Add Prettier devDependency in Katib UI (#1629 by @seong7)
- Fix a link for GRPC API documentation (#1786 by @tenzen-y)
- Add my presentations that include Katib (#1753 by @terrytangyuan)
- Add Akuity to list of adopters (#1749 by @terrytangyuan)
- Change Argo -> Argo Workflows (#1741 by @terrytangyuan)
- Update Algorithm Service Doc for the new CI script (#1724 by @andreyvelich)
- Update link to Training Operator (#1699 by @terrytangyuan)
- Refactor examples folder structure (#1691 by @andreyvelich)
- Fix README in examples directory (#1687 by @tenzen-y)
- Add Kubeflow MXJob example (#1688 by @andreyvelich)
- Update FPGA examples (#1685 by @eliaskoromilas)
- Refactor README (#1667 by @andreyvelich)
- Change the minimal Kustomize version in the developer guide (#1675 by @tenzen-y)
- Add Katib release process guide (#1641 by @andreyvelich)
- Remove unrecognized keys from metadata.yaml in Charmed operators (#1759 by @DnPlas)
- Fix the default Metrics Collector regex (#1755 by @andreyvelich)
- Fix Status Handling in Charmed Operators (#1743 by @DomFleischmann)
- Fix bug on list type HP in Katib UI (#1704 by @seong7)
- Fix Range for Int and Double values in Grid search (#1732 by @andreyvelich)
- Check if parameter references exist in Experiment parameters (#1726 by @henrysecond1)
- Fix same set for HyperParameters in Bayesian Optimization algorithm (#1701 by @fabianvdW)
- Close MySQL statement and rows resources when SQL exec ends (#1720 by @chenwenjun-github)
- Fix Cluster Role of Katib Controller to access image pull secrets (#1725 by @henrysecond1)
- Emit events when fails to reconcile all Trials (#1706 by @henrysecond1)
- Missing metrics port annotation (#1715 by @alexeykaplin)
- Fix absolute value in Katib UI (#1676 by @anencore94)
- Add missing omitempty parameter to APIs (#1645 by @andreyvelich)
- Reconcile semantics for Suggestion Algorithms (#1633 by @johnugeorge)
- Fix default label for Training Operators (#1813 by @andreyvelich)
- Update supported Python version for Katib SDK (#1798 by @tenzen-y)
- Use release tags for Trial images (#1757 by @andreyvelich)
- Upgrade cert-manager API from v1alpha2 to v1 (#1752 by @haoxins)
- Add Workflow to Publish Katib Images (#1746 by @andreyvelich)
- Update Charmed Katib Operators + CI to 0.12 (#1717 by @knkski)
- Updating Katib CI to use Training Operator (#1710 by @midhun1998)
- Update OWNERS for Charmed operators (#1718 by @ca-scribner)
- Implement some unit tests for the Katib Config package (#1690 by @tenzen-y)
- Add GitHub Actions for Python unit tests (#1677 by @andreyvelich)
- Add OWNERS file for the Katib new UI (#1681 by @kimwnasptd)
- Add envtest to check
reconcileRBAC
(#1678 by @tenzen-y) - Use golangci-lint as linter for Go (#1671 by @tenzen-y)
v0.13.0-rc.1 (2022-02-15)
- Fix default label for Training Operators (#1813 by @andreyvelich)
- Update supported Python version for Katib SDK (#1798 by @tenzen-y)
v0.13.0-rc.0 (2022-01-25)
- Implement validation for Early Stopping (#1709 by @tenzen-y)
- Change namespace label for Metrics Collector injection (#1740 by @andreyvelich)
- Modify gRPC API with Current Request Number (#1728 by @andreyvelich)
- Allow to remove each resource in Katib config (#1729 by @andreyvelich)
- Support leader election for Katib Controller (#1713 by @tenzen-y)
- Change default Metrics Collect format (#1707 by @anencore94)
- Bump Python version to 3.9 (#1731 by @tenzen-y)
- Update Go version to 1.17 (#1683 by @andreyvelich)
- Create Python script to run e2e Argo Workflow (#1674 by @andreyvelich)
- Reimplement Katib Cert Generator in Go (#1662 by @tenzen-y)
- SDK: change list apis to return objects as default (#1630 by @anencore94)
- Enhance Katib UI feasible space (#1721 by @seong7)
- Handle missing TrialTemplates in Katib UI (#1652 by @kimwnasptd)
- Add Prettier devDependency in Katib UI (#1629 by @seong7)
- Fix a link for GRPC API documentation (#1786 by @tenzen-y)
- Add my presentations that include Katib (#1753 by @terrytangyuan)
- Add Akuity to list of adopters (#1749 by @terrytangyuan)
- Change Argo -> Argo Workflows (#1741 by @terrytangyuan)
- Update Algorithm Service Doc for the new CI script (#1724 by @andreyvelich)
- Update link to Training Operator (#1699 by @terrytangyuan)
- Refactor examples folder structure (#1691 by @andreyvelich)
- Fix README in examples directory (#1687 by @tenzen-y)
- Add Kubeflow MXJob example (#1688 by @andreyvelich)
- Update FPGA examples (#1685 by @eliaskoromilas)
- Refactor README (#1667 by @andreyvelich)
- Change the minimal Kustomize version in the developer guide (#1675 by @tenzen-y)
- Add Katib release process guide (#1641 by @andreyvelich)
- Remove unrecognized keys from metadata.yaml in Charmed operators (#1759 by @DnPlas)
- Fix the default Metrics Collector regex (#1755 by @andreyvelich)
- Fix Status Handling in Charmed Operators (#1743 by @DomFleischmann)
- Fix bug on list type HP in Katib UI (#1704 by @seong7)
- Fix Range for Int and Double values in Grid search (#1732 by @andreyvelich)
- Check if parameter references exist in Experiment parameters (#1726 by @henrysecond1)
- Fix same set for HyperParameters in Bayesian Optimization algorithm (#1701 by @fabianvdW)
- Close MySQL statement and rows resources when SQL exec ends (#1720 by @chenwenjun-github)
- Fix Cluster Role of Katib Controller to access image pull secrets (#1725 by @henrysecond1)
- Emit events when fails to reconcile all Trials (#1706 by @henrysecond1)
- Missing metrics port annotation (#1715 by @alexeykaplin)
- Fix absolute value in Katib UI (#1676 by @anencore94)
- Add missing omitempty parameter to APIs (#1645 by @andreyvelich)
- Reconcile semantics for Suggestion Algorithms (#1633 by @johnugeorge)
- Use release tags for Trial images (#1757 by @andreyvelich)
- Upgrade cert-manager API from v1alpha2 to v1 (#1752 by @haoxins)
- Add Workflow to Publish Katib Images (#1746 by @andreyvelich)
- Update Charmed Katib Operators + CI to 0.12 (#1717 by @knkski)
- Updating Katib CI to use Training Operator (#1710 by @midhun1998)
- Update OWNERS for Charmed operators (#1718 by @ca-scribner)
- Implement some unit tests for the Katib Config package (#1690 by @tenzen-y)
- Add GitHub Actions for Python unit tests (#1677 by @andreyvelich)
- Add OWNERS file for the Katib new UI (#1681 by @kimwnasptd)
- Add envtest to check
reconcileRBAC
(#1678 by @tenzen-y) - Use golangci-lint as linter for Go (#1671 by @tenzen-y)
v0.12.0 (2021-10-05)
- Add Optuna based suggestion service (#1613 by @g-votte)
- Support Sobol's Quasirandom Sequence using Goptuna. (#1523 by @c-bata)
- Bump the Goptuna version up to v0.8.0 with IPOP-CMA-ES and BIPOP-CMA-ES support. (#1519 by @c-bata)
- Validate possible operations for Grid suggestion (#1205 by @andreyvelich)
- Validate for Bayesian Optimization algorithm settings (#1600 by @anencore94)
- Add Support for Argo Workflows (#1605 by @andreyvelich)
- Add Support for XGBoost Operator with LightGBM example (#1603 by @andreyvelich)
- Allow empty resources for CPU and Memory in Katib config (#1564 by @andreyvelich)
- Add kustomization overlay: katib-openshift (#1513 by @maanur)
- Switch to SDI in Katib Charm (#1555 by @knkski)
- Add Multivariate TPE to Katib UI (#1625 by @andreyvelich)
- Update Katib UI with Optuna Algorithm Settings (#1626 by @andreyvelich)
- Change the default image for the new Katib UI (#1608 by @andreyvelich)
- Add Katib 2021 ROADMAP (#1524 by @andreyvelich)
- Add AutoML and Training WG Summit July 2021 (#1615 by @andreyvelich)
- Add the new Katib presentations 2021 (#1539 by @andreyvelich)
- Add Doc checklist to PR template (#1568 by @andreyvelich)
- Fix typo in operators/README (#1557 by @evilnick)
- Adds docs on how to use Katib Charm within KF (#1556 by @RFMVasconcelos)
- Fix a link to Kustomize manifest for new Katib UI (#1521 by @c-bata)
- Fix UI for handling missing params (#1657 by @kimwnasptd)
- Reconcile semantics for Suggestion Algorithms (#1644 by @johnugeorge)
- Fix Metrics Collector error in case of non-existing Process (#1614 by @andreyvelich)
- Fix mysql version in docker image (#1594 by @munagekar)
- Fix grep in Tekton Experiment Doc (#1578 by @andreyvelich)
- Error messages corrected (#1522 by @himanshu007-creator)
- Install charmcraft 1.0.0 (#1593 by @DomFleischmann)
- Modify XGBoostJob example for the new Controller (#1623 by @andreyvelich)
- Modify Labels for controller resources (#1621 by @andreyvelich)
- Modify Labels for Katib Components (#1611 by @andreyvelich)
- Upgrade CRDs to apiextensions.k8s.io/v1 (#1610 by @andreyvelich)
- Update Katib SDK with OpenAPI generator (#1572 by @andreyvelich)
- Disable default PV for Experiment with resume from volume (#1552 by @andreyvelich)
- Remove PV from MySQL component (#1527 by @andreyvelich)
- feat: add naming regex check on validating webhook (#1541 by @anencore94)
v0.12.0-rc.1 (2021-09-07)
- Fix UI for handling missing params (#1657 by @kimwnasptd)
- Reconcile semantics for Suggestion Algorithms (#1644 by @johnugeorge)
v0.12.0-rc.0 (2021-08-19)
- Add Optuna based suggestion service (#1613 by @g-votte)
- Support Sobol's Quasirandom Sequence using Goptuna. (#1523 by @c-bata)
- Bump the Goptuna version up to v0.8.0 with IPOP-CMA-ES and BIPOP-CMA-ES support. (#1519 by @c-bata)
- Validate possible operations for Grid suggestion (#1205 by @andreyvelich)
- Validate for Bayesian Optimization algorithm settings (#1600 by @anencore94)
- Add Support for Argo Workflows (#1605 by @andreyvelich)
- Add Support for XGBoost Operator with LightGBM example (#1603 by @andreyvelich)
- Allow empty resources for CPU and Memory in Katib config (#1564 by @andreyvelich)
- Add kustomization overlay: katib-openshift (#1513 by @maanur)
- Switch to SDI in Katib Charm (#1555 by @knkski)
- Add Multivariate TPE to Katib UI (#1625 by @andreyvelich)
- Update Katib UI with Optuna Algorithm Settings (#1626 by @andreyvelich)
- Change the default image for the new Katib UI (#1608 by @andreyvelich)
- Add Katib 2021 ROADMAP (#1524 by @andreyvelich)
- Add AutoML and Training WG Summit July 2021 (#1615 by @andreyvelich)
- Add the new Katib presentations 2021 (#1539 by @andreyvelich)
- Add Doc checklist to PR template (#1568 by @andreyvelich)
- Fix typo in operators/README (#1557 by @evilnick)
- Adds docs on how to use Katib Charm within KF (#1556 by @RFMVasconcelos)
- Fix a link to Kustomize manifest for new Katib UI (#1521 by @c-bata)
- Fix Metrics Collector error in case of non-existing Process (#1614 by @andreyvelich)
- Fix mysql version in docker image (#1594 by @munagekar)
- Fix grep in Tekton Experiment Doc (#1578 by @andreyvelich)
- Error messages corrected (#1522 by @himanshu007-creator)
- Install charmcraft 1.0.0 (#1593 by @DomFleischmann)
- Modify XGBoostJob example for the new Controller (#1623 by @andreyvelich)
- Modify Labels for controller resources (#1621 by @andreyvelich)
- Modify Labels for Katib Components (#1611 by @andreyvelich)
- Upgrade CRDs to apiextensions.k8s.io/v1 (#1610 by @andreyvelich)
- Update Katib SDK with OpenAPI generator (#1572 by @andreyvelich)
- Disable default PV for Experiment with resume from volume (#1552 by @andreyvelich)
- Remove PV from MySQL component (#1527 by @andreyvelich)
- feat: add naming regex check on validating webhook (#1541 by @anencore94)
v0.11.1 (2021-06-09)
- Fix Katib manifest for Kubeflow 1.3 (#1503 by @yanniszark)
- Fix Katib release script (#1510 by @andreyvelich)
- Remove Application CR (#1509 by @yanniszark)
- Modify Katib manifest to support newer Kustomize version (#1515 by @DavidSpek)
v0.11.0 (2021-03-22)
- Disable dynamic Webhook creation (#1450 by @andreyvelich)
- Add the
waitAllProcesses
flag to the Katib config (#1394 by @robbertvdg) - Migrate Katib to Go modules (#1438 by @andreyvelich)
- Update Katib SDK with the
get_success_trial_details
API (#1442 by @Adarsh2910) - Add release process script (#1473 by @andreyvelich)
- Refactor the Katib installation using Kustomize (#1464 by @andreyvelich)
- First step for the Katib new UI implementation (#1427 by @kimwnasptd)
- Add missing fields to the Katib new UI (#1463 by @kimwnasptd)
- Add instructions to install the new Katib UI (#1476 by @kimwnasptd)
- Add Juju operator support for Katib (#1403 by @knkski)
- Add GitHub Actions for the Juju operator (#1407 by @knkski)
- Add install docs for the Juju operator (#1411 by @RFMVasconcelos)
- Modify ClusterRoles for the Juju operator (#1426 by @DomFleischmann)
- Update the Juju operator with the new Katib Webhooks (#1465 by @knkski)
- Fix compare step for Early Stopping (#1386 by @andreyvelich)
- Fix Early Stopping in the Goptuna Suggestion (#1404 by @andreyvelich)
- Fix SDK examples to work with the Katib 0.10 (#1402 by @andreyvelich)
- Fix links in the TFEvent Metrics Collector (#1417 by @zuston)
- Fix the gRPC build script (#1492 by @andreyvelich)
- Modify docs for the Katib 0.10 (#1392 by @andreyvelich)
- Add Katib presentation list (#1446 by @andreyvelich)
- Add Canonical to the Katib Adopters (#1401 by @RFMVasconcelos)
- Update developer guide with the Katib controller flags (#1449 by @annajung)
- Add Fuzhi to the Katib Adopters (#1451 by @Planck0591)
- Fix Katib broken links to the Kubeflow guides (#1477 by @theofpa)
- Add the Katib Webhook docs (#1486 by @andreyvelich)
- Add recreate strategy for the MySQL deployment (#1393 by @andreyvelich)
- Modify worker image for the Katib AWS CI/CD (#1423 by @PatrickXYS)
- Add the SVG logo for Katib (#1414 by @knkski)
- Verify empty Objective in the Experiment defaults (#1445 by @andreyvelich)
- Move the Katib manifests upstream (#1432 by @yanniszark)
- Build the Trial images in the Katib CI (#1457 by @andreyvelich)
- Add script to update the boilerplates (#1491 by @andreyvelich)
v0.10.1 (2021-03-02)
- add adopter (#1451 by @Planck0591)
- Add katib controller flags to developers guide (#1449 by @annajung)
- Enhance katib client by adding get_success_trial_details() (#1442 by @Adarsh2910)
- Add Katib presentations and community information (#1446 by @andreyvelich)
- Verify nil objective in Experiment defaults (#1445 by @andreyvelich)
- Migrate to Go modules (#1438 by @andreyvelich)
- Change roles to clusterroles for operators (#1426 by @DomFleischmann)
- Migrate katib to new test-infra (#1423 by @PatrickXYS)
- Add SVG logo traced from bitmap logo (#1414 by @knkski)
- Invalid example url (#1417 by @zuston)
- Fix SDK examples for 0.10 version (#1402 by @andreyvelich)
- Add Github Actions CI for charm operators (#1407 by @knkski)
- Add Juju install commands to operators README (#1411 by @RFMVasconcelos)
- Fix indentation in the OWNERS file (#1408 by @andreyvelich)
- Bump Prettier to 2.2.0 for the Katib UI (#1409 by @andreyvelich)
- Add Katib Bundle for Juju (#1403 by @knkski)
- Remove duecredit pkg from the Suggestions (#1406 by @andreyvelich)
- Fix Early Stopped Trials in Goptuna Suggestion (#1404 by @andreyvelich)
- Remove v1alpha3 version (#1396 by @andreyvelich)
- Update docs for Katib 0.10 (#1392 by @andreyvelich)
- Adding to ADOPTERS.md (#1401 by @RFMVasconcelos)
- Feature/waitallprocesses config (#1394 by @robbertvdg)
- Add recreate strategy to MySQL deployment (#1393 by @andreyvelich)
- Move Adopters file (#1391 by @andreyvelich)
- Add Stale config to close inactivity issues (#1390 by @andreyvelich)
- Remove new Trial kind doc (#1388 by @andreyvelich)
- Fix compare step for the early stopping (#1386 by @andreyvelich)
v0.10.0 (2020-11-07)
- The new Trial template design (#1208 by @andreyvelich)
- Support custom Kubernetes CRD in the Trial template (#1214 by @andreyvelich)
- Add example for the Tekton
Pipeline
(#1339 by @andreyvelich) - Add example for the Kubeflow
MPIJob
(#1342 by @andreyvelich)
- Add example for the Tekton
- Support early stopping with the Median Stopping Rule (#1344 by @andreyvelich)
- Resume Experiment from the volume (#1275 by @andreyvelich)
- Support volume settings in the Katib config (#1291 by @andreyvelich)
- Extract the Experiment metrics in multiple ways (#1140 by @sperlingxx)
- Update the Python SDK for the v1beta1 version (#1252 by @sperlingxx)
- Show the Trial parameters on the submit Experiment page (#1224 by @andreyvelich)
- Enable to set the Trial template YAML from the submit Experiment page (#1363 by @andreyvelich)
- Optimise the Katib UI image (#1232 by @andreyvelich)
- Enable sorting in the Trial list table (#1251 by @andreyvelich)
- Add pages to the Trial list table (#1262 by @andreyvelich)
- Use the V4 version for the Material UI (#1254 by @andreyvelich)
- Automatically delete an empty ConfigMap with Trial templates (#1260 by @andreyvelich)
- Create a ConfigMap with Trial templates (#1265 by @andreyvelich)
- Support metrics strategies on the submit Experiment page (#1364 by @andreyvelich)
- Add the resume policy to the submit Experiment page (#1362 by @andreyvelich)
- Enable to create an early stopping Experiment from the submit Experiment page (#1373 by @andreyvelich)
- Check the Trials count before deleting it (#1223 by @gaocegege)
- Check that Trials are deleted (#1288 by @andreyvelich)
- Fix the out of range error in the Hyperopt suggestion (#1315 by @andreyvelich)
- Fix the pod ownership to inject the metrics collector (#1303 by @andreyvelich)
- Switch the test infra to the AWS (#1356 by @andreyvelich)
- Use the
docker.io/kubeflowkatib
registry to release images (#1372 by @andreyvelich)
v0.9.0 (2020-06-10)
- add clientset/lister/informer generation (#1194 by @sperlingxx)
- New Trial Template API controller implementation (#1202 by @andreyvelich)
- Add citation information (#1210 by @terrytangyuan)
- Python SDK for katib (#1177 by @prem0912)
- Rename algorithm_setting to algorithm_settings in manager (#1204 by @andreyvelich)
- Update doc for training container images with DARTS (#1201 by @andreyvelich)
- Re: Support string metrics values in Controller (#1200 by @andreyvelich)
- Modify new algorithm service doc (#1198 by @andreyvelich)
- Katib v1beta1 version (#1197 by @andreyvelich)
- Add more algorithm settings to DARTS (#1195 by @andreyvelich)
- Fix additional metrics in TF Event metrics collector (#1191 by @andreyvelich)
- UI: Fix comparison of metric values in Metrics Info Plot (#1192 by @andreyvelich)
- Support one and two NN layers in DARTS (#1185 by @andreyvelich)
- Revert 1176 PR (Support string metric values) (#1189 by @andreyvelich)
- Fix Never Resume Policy for Experiment (#1184 by @andreyvelich)
- Change scikit-learn version to 0.22.0 for BO Suggestion (#1187 by @andreyvelich)
- DARTS documentation (#1180 by @andreyvelich)
- Unittest for DARTS Suggestion (#1179 by @andreyvelich)
- Build image for DARTS Suggestion (#1178 by @andreyvelich)
- DARTS Suggestion (#1175 by @andreyvelich)
- Support string metrics values in Controller (#1176 by @andreyvelich)
- Delete Suggestion deployment after Experiment is finished (#1150 by @sperlingxx)
- Fix Cuda version in training container for ENAS (#1172 by @andreyvelich)
- Rename chocolate algorithm names for consistency (#1164 by @c-bata)
- restructure algorithm configuration for hyperopt_service (#1161 by @sperlingxx)
- Refactor suggestion services folder structure (#1166 by @andreyvelich)
- UI: Suggestion view from Experiment page (#1162 by @andreyvelich)
- Add support Kubeflow operators to ROADMAP (#1145 by @andreyvelich)
- Remove Suggestion Request from Update Suggestion (#1158 by @andreyvelich)
- E2E test for CMA-ES Suggestion (#1157 by @andreyvelich)
- Build Goptuna Suggestion image in CI (#1154 by @andreyvelich)
- Add an example of Goptuna suggestion service (#1155 by @c-bata)
- ENAS enable to add None values in algorithm settings (#1153 by @andreyvelich)
- Support Categorical string values in Chocolate Suggestion (#1149 by @andreyvelich)
- katib ui: adapt environment in which cluster role is unavailable (#1141 by @sperlingxx)
- Add Goptuna based suggestion service for CMA-ES. (#1131 by @c-bata)
- ENAS Check Algorithm Settings in Validate Function (#1146 by @andreyvelich)
- Change folder structure for NAS algorithms, rename NASRL to ENAS (#1143 by @andreyvelich)
- Update ENAS Algorithm Settings in Katib UI (#1142 by @andreyvelich)
- Refactor NAS RL Suggestion (#1134 by @andreyvelich)
- Fix duplicated imports (#1133 by @c-bata)
- Remove Anneal from supported algorithms (#1139 by @c-bata)
- Refactor file-metricscollector (#1137 by @c-bata)
- Fix typo in suggestion packages (#1138 by @c-bata)
- Bump up the Go version to 1.14.2 at Travis CI (#1132 by @c-bata)
- Fix NotImplementedError for TPE and Random suggestion. (#1130 by @c-bata)
- Add ENAS enhancements to ROADMAP (#1129 by @andreyvelich)
- feat: Add 2020 roadmap (#1121 by @gaocegege)
- Optimize Chocolate Suggestion (#1116 by @andreyvelich)
- Support step for int parameter in Chocolate and Hyperopt Suggestion (#1123 by @andreyvelich)
- UI: Remove antd package (#1117 by @andreyvelich)
- Fix Katib e2e tests (#1118 by @andreyvelich)
- UI: Close menu on click (#1114 by @andreyvelich)
- UI: Change style for submit Experiment from YAML (#1113 by @andreyvelich)
- Refactor python files in cmd/suggestion (#1105 by @c-bata)
- Update prow config with the latest folders (#1109 by @andreyvelich)
- Fix logger namespace (#1108 by @c-bata)
- chore(deps): Bump tensorflow from 1.14.0 to 1.15.2 in /cmd/suggestion/nasrl/v1alpha3 (#1035 by @dependabot[bot])
- Refactor suggestion-internal-modules (#1106 by @c-bata)
- chore(deps): Bump psutil from 5.2.2 to 5.6.6 in /cmd/metricscollector/v1alpha3/tfevent-metricscollector (#1085 by @dependabot[bot])
- Fix custom Katib DB Manager env variables (#1102 by @andreyvelich)
- Refactor python files of suggestion services (#1107 by @c-bata)
- Add myself to approvers (#1103 by @andreyvelich)
- Enable to add Service Account Name in Katib config (#1092 by @andreyvelich)
- chore(deps): Bump tensorflow-gpu from 1.15.0 to 1.15.2 in /examples/v1alpha3/NAS-training-containers/RL-cifar10 (#1034 by @dependabot[bot])
- chore(deps): Bump tensorflow from 1.15.0 to 1.15.2 in /examples/v1alpha3/NAS-training-containers/RL-cifar10 (#1036 by @dependabot[bot])
- Add ghalton package to Chocolate Suggestion (#1101 by @andreyvelich)
- Enable to run Experiment without Goal (#1065 by @andreyvelich)
- UI: Support Trial Templates in all namespaces and all configMaps (#1083 by @andreyvelich)
- Fix Chocolate mocmaes algorithm name in Suggestion (#1097 by @andreyvelich)
- UI: Add Metrics Collector Spec to Submit Experiment (#1096 by @andreyvelich)
- UI: Add Experiment view for NAS Jobs (#1077 by @andreyvelich)
- Enable Prettier code formatting for the Katib UI (#1078 by @andreyvelich)
- Adding Karrot as adopter (#1074 by @rky0930)
- fix annotations (#1072 by @sperlingxx)
- Add more unit tests in Katib (#1071 by @andreyvelich)
- dynamic jobProvider and suggestionComposer registration (#1069 by @sperlingxx)
- UI: Update supported algorithms (#1070 by @andreyvelich)
- Fix TPE Suggestion (#1063 by @andreyvelich)
- Update Katib docs (#1066 by @andreyvelich)
- UI: Show best metrics in the Trial metrics information plot (#1068 by @andreyvelich)
- Update adopters (#1064 by @janvdvegt)
- Create Optimizer in BO Suggestion only for the first run (#1057 by @andreyvelich)
- Add missing GRPC health probe for arm64 to db-manager (#1059 by @MrXinWang)
- Change tell method for BO Suggestion (#1055 by @andreyvelich)
- MXNet -> Apache MXNet (#1056 by @terrytangyuan)
- Adding error propagation for K8s client creation in KatibClient (#1053 by @akirillov)
- openAPI generation for katib resources (#1054 by @sperlingxx)
- Disable istio sidecar injection in Suggestion and Training Jobs (#1050 by @andreyvelich)
- UI: The best metrics in Trial table (#1048 by @andreyvelich)
- UI: Fix problem with equal time in different Trial metrics (#1047 by @andreyvelich)
- Adding Babylon Health as adopter (#1046 by @jeremievallee)
- Update adopter (#1038 by @ywskycn)
- UI: Add Trial Status to HP Job Table (#1032 by @andreyvelich)
- UI: Experiment view in the Dialog (#1031 by @andreyvelich)
- record TrialNames group by stages in ExperimentStatus (#1023 by @sperlingxx)
- chore: Update docs (#1024 by @gaocegege)
- fix: Ignore trials without metrics (#1028 by @gaocegege)
- UI: Fix Trial Metric in the Plot (#1027 by @andreyvelich)
- feat: Add a flag to support removing GRPC probe at runtime (#1020 by @gaocegege)
- Adding cisco in Katib adopters (#1026 by @johnugeorge)
- add antfin into list of adoptors (#1025 by @sperlingxx)
- Updated links and instructions for Katib API docs (#1022 by @sarahmaddox)
- feat: Add adopters (#1019 by @gaocegege)
- [FileMetricsCollector]skip line without metrics keywords (#1018 by @sperlingxx)
- Added version number and TODO descriptions to API proto (#1017 by @sarahmaddox)
- fix: First check failed condition (#1015 by @gaocegege)
- feat: Do not inject sh -c when it exists (#1010 by @gaocegege)
- Nerual -> Neural (#1000 by @tmielika)
- [Feature] Enable imagePullPolicy in Katib Config (#1013 by @andreyvelich)
- fix: Avoid out-of-range exception (#1012 by @gaocegege)
- E2E Test for NAS RL Suggestion (#1011 by @andreyvelich)
- Example with collecting timestamp of the metrics (#970 by @andreyvelich)
- Add NAS RL training container to kubeflowkatib repository (#1008 by @andreyvelich)
- Fix number of Trials problem in NAS RL Suggestion (#1009 by @andreyvelich)
- Rename katib DB manager (#1006 by @hougangliu)
- chore(deps): Bump tensorflow from 1.12.0 to 1.15.0 in /examples/v1alpha3/NAS-training-containers/RL-cifar10 (#1005 by @dependabot[bot])
- chore(deps): Bump tensorflow-gpu from 1.12.0 to 1.15.0 in /examples/v1alpha3/NAS-training-containers/RL-cifar10 (#978 by @dependabot[bot])
- CPU example for NAS RL cifar10 training container (#999 by @andreyvelich)
- Updated links to docs/github on Katib dashboard (#1003 by @sarahmaddox)
- Fixed a few typos (#1001 by @sarahmaddox)
- fix: Inherit labels and annotations from experiment (#998 by @gaocegege)
- Moved some content and added links to Kubeflow docs (#990 by @sarahmaddox)
- feat: Support resource in sidecar (#991 by @gaocegege)
- fix: Ignore the failure (#996 by @gaocegege)
- UI: Select namespace from Kubeflow dashboard (#982 by @andreyvelich)
- feat: Add a flag to control the logic about sc (#994 by @gaocegege)
- Initialize securityContext in injected metrics container (#964 by @vpavlin)
- add disk setting into suggestionConfiguration (#989 by @sperlingxx)
- Get dbUser from Env or default('root') (#985 by @UrmsOne)
- feat(experiment_status): Add trial name (#986 by @gaocegege)
- feat(config): Add a new config for webhook (#980 by @gaocegege)
- add metrics for trial (#974 by @yeya24)
- Use port higher than 1024 to be able to run as a non-root user (#960 by @vpavlin)
- Remove redundant serviceAccountName assignment (#969 by @hougangliu)
- Increase Suggestion memory limit (#958 by @andreyvelich)
- User root user explicitely for DB readinessProbe (#962 by @vpavlin)
- Fix typo in getKabitJob function name (#965 by @vpavlin)
- Use port 8080 for Katib UI (#967 by @vpavlin)
- Validate experiment (#957 by @hougangliu)
- UI: Support namespace selection in experiment monitor (#950 by @andreyvelich)
- Delete v1alpha2 api (#953 by @johnugeorge)
- Resume experiment with extra trials from last checkpoint (#952 by @johnugeorge)
- Add a gauge metric for current experiments (#954 by @yeya24)
- feat: Support running (#894 by @gaocegege)
- Use kubeflowkatib repo as image repo of example (#949 by @hougangliu)
- Update API spec for early stopping (#951 by @richardsliu)
- rename counter metrics (#942 by @yeya24)
- update deployment api version (#937 by @yeya24)
- Fix: Empty Trial templates in Katib UI (#938 by @andreyvelich)
- Implement metrics custom filters (#947 by @hougangliu)
- Remove katib webhook when undeploy (#935 by @hougangliu)
- Change web failPolicy to fail instead of default ingore (#933 by @hougangliu)
- feat: Add limit for suggestion pod (#932 by @gaocegege)
- Support multiple metric logs in one line (#925 by @hougangliu)
- Tfevent metriccollector fails when multiple files exist (#920 by @hougangliu)
- Handle metricscollector case worker container have no command (#914 by @hougangliu)
- tfevent-metricscollector support ppc64le (#912 by @hmtai)
- Fix grid suggestion ValidateAlgorithmSettings return (#913 by @hougangliu)
- Fix wrong suggestion service endpoint (#911 by @hougangliu)
- Enable arm64 architecture support for katib images and fix grpc health probe multiarch error. (#897 by @MrXinWang)
- feat: Support custom databases (#910 by @gaocegege)
- Enhance validation for metrics collector (#909 by @hougangliu)
- Support custom metrics collector kind (#908 by @hougangliu)
- support ppc64le (#893 by @hmtai)
- fix: Add Suggestion into CI (#907 by @gaocegege)
- Validate algorithm (#904 by @hougangliu)
- Support restarting training job (#901 by @hougangliu)
- Fix katib-manager crash in kubeflow cluster (#900 by @hougangliu)
- Revert env for katib-db (#899 by @hougangliu)
- feat: Patch to fix running condition (#895 by @gaocegege)
- feat: Add quick start (#878 by @gaocegege)
- Pin operators to 0.7 branch (#885 by @johnugeorge)
- fix: Use 64 instead of 32 since we are using float64 (#883 by @gaocegege)
- fix: Use as instead of , to support python 3 in tfevent metrics collector (#881 by @gaocegege)
- feat: Add event when the reconcile is failed (#879 by @gaocegege)
- feat: Add events in experiment (#880 by @gaocegege)
- Remove unused katib-manager-rest (#876 by @hougangliu)
- feat: Refactor to make it easy to extend new kinds (#865 by @gaocegege)
- feat: Support random state in random search (#873 by @gaocegege)
- Add prometheus metrics for experiment and trial (#870 by @hougangliu)
- fix: Use binary in test (#875 by @gaocegege)
- feat: Support env in mysql (#868 by @gaocegege)
- feat: Add liveness probe for DB (#871 by @gaocegege)
- Remove unused files (#869 by @hougangliu)
- feat: Add doc about algorithm (#867 by @gaocegege)
- feat: Add doc about how to add a new kind in trial (#844 by @gaocegege)
- Adding metric unavailability to events (#864 by @johnugeorge)
- Fix worker error silent (#863 by @hougangliu)
- feat: Show experiment status in json (#853 by @gaocegege)
- Finish reconcile only after running trials are complete (#861 by @johnugeorge)
- Update Readme (#860 by @johnugeorge)
- fix: Fix docs about metrics collection and suggestion design (#858 by @gaocegege)
- Adding events to trials (#852 by @johnugeorge)
- chore: Add dockerignore, enhance liveness for manager (#851 by @gaocegege)
- fix: Reorder to skip observation collection (#847 by @gaocegege)
- feat: Set default namespace and template for trial (#850 by @gaocegege)
- fix: Use namespace to get trial list (#846 by @gaocegege)
- [docs] Add suggestion proposal (#726 by @gaocegege)
- feat: Add doc for implementing new algorithms (#769 by @gaocegege)
- feat: Support namespace in NAS UI (#839 by @gaocegege)
- feat: Show all experiments in monitor (#835 by @gaocegege)
- Delete jobs when trials are completed (#838 by @johnugeorge)
- Remove unused manager message definition (#837 by @hougangliu)
- Add tfjob and pytorch examples to e2e (#820 by @johnugeorge)
- fix: Update liveness probe to avoid problems (#833 by @gaocegege)
- Remove used katib-manager code (#836 by @hougangliu)
- File metrics collector end to end test (#832 by @hougangliu)
- feat: support namespace for trial template (#827 by @gaocegege)
- Remove metrics in DB when delete trial (#830 by @hougangliu)
- Update status conditions during reconcile error (#831 by @johnugeorge)
- feat: Use env var for namespace (#829 by @gaocegege)
- Make sure experiment namespace can inject metriccollector sidecar (#828 by @hougangliu)
- Doc about katib workflow design (#824 by @hougangliu)
- fix: Support multiple namespaces when using kubectl (#826 by @gaocegege)
- feat: Support step when using grid in UI (#821 by @gaocegege)
- fix: Build e2e-runner (#822 by @gaocegege)
- Fix stdout of worker container show nothing (#819 by @hougangliu)
- feat: Remove useless APIs (#818 by @gaocegege)
- feat: Add validation for grid (#812 by @gaocegege)
- Adding additional printer columns for better debugging (#817 by @johnugeorge)
- metrics-collector role is not useful any more (#816 by @hougangliu)
- Rename algorithm deployment and service (#814 by @hougangliu)
- fix: Fix the type (#813 by @gaocegege)
- feat: Add tpe e2e test case (#809 by @gaocegege)
- Remove unused field from Experiment Spec (#806 by @johnugeorge)
- feat: Add HyperBand (#787 by @gaocegege)
- Removing unnecessary config from examples (#803 by @johnugeorge)
- feat: Add NAS RL based algorithm (#793 by @gaocegege)
- fix: Remove copy (#802 by @gaocegege)
- Using example as the default trial (#801 by @johnugeorge)
- Removing metric collector templates from UI (#800 by @johnugeorge)
- fix: Use commitid (#799 by @gaocegege)
- Use common metricsCollector struct (#798 by @hougangliu)
- build: Support arguments (#795 by @gaocegege)
- feat: Rename algorithms (#794 by @gaocegege)
- feat: Add events in suggestion (#796 by @gaocegege)
- UI: Fix problems (#786 by @gaocegege)
- Implement tfevent collector (#792 by @hougangliu)
- Run e2e tests parallel (#790 by @johnugeorge)
- Mark trial as failed when job fails (#791 by @johnugeorge)
- Adding javascripts locally (#789 by @johnugeorge)
- feat: Add grid with the help of chocolate (#780 by @gaocegege)
- feat: Add bayesian (#777 by @gaocegege)
- Implement file metrics collector (#783 by @hougangliu)
- feat: Remove useless algorithms (#782 by @gaocegege)
- Adding algorithm deployment status to Suggestion status (#784 by @johnugeorge)
- Wait for GRPC server to be up (#785 by @johnugeorge)
- feat: Add GRPC health check in suggestions (#779 by @gaocegege)
- feat: Add more output in e2e test for debug purpose and fix test cases (#775 by @gaocegege)
v0.6.0-rc.0 (2019-06-28)
- Add npm build to the UI Dockerfile (#665 by @andreyvelich)
- MetricController: Run only a single job per task (#660 by @epa095)
- Build images for nasrl training container (#669 by @andreyvelich)
- UI: Add delete experiment functionality (#654 by @andreyvelich)
- UI: Change adding a template (#656 by @andreyvelich)
- UI: Select Objective Type from the list (#653 by @andreyvelich)
- Add e2e test to presubmit (#652 by @johnugeorge)
- fix: Do not use webhook in UT (#657 by @gaocegege)
- Enhancing katib client apis (#650 by @johnugeorge)
- Wrong mock file name (#651 by @johnugeorge)
- UI: Show only succeeded Trials (#646 by @andreyvelich)
- v1alpha2 hyperband suggestion service validation (#648 by @hougangliu)
- refactor: Remove requests check for most test cases (#626 by @gaocegege)
- feat(experiment): Delete dup trials (#647 by @gaocegege)
- UI: Add bayesianoptimization algorithm in selectlist (#645 by @andreyvelich)
- Fix v1alpha1 hyperband algorithm mismatch (#634 by @hougangliu)
- v1alpha2 hyperband suggestion service (#631 by @hougangliu)
- Upgrade Job operators to v1 (#635 by @johnugeorge)
- Fix sql syntax for UpdateAlgorithmExtraSettings (#633 by @hougangliu)
- Update Algorithm extra settings during experiment creation (#630 by @johnugeorge)
- Adding cascading delete of pods when jobs are deleted (#632 by @johnugeorge)
- Add tests for grid suggestion algorithm (#628 by @johnugeorge)
- Fixing tag for Suggestion BO (#627 by @johnugeorge)
- Training Container for NAS RL Suggestion in v1alpha2 (#614 by @andreyvelich)
- Implementing v1alpha2 grid search suggestion algorithm (#622 by @johnugeorge)
- feat: Support bayesianoptimization in v1alpha2 (#595 by @gaocegege)
- NAS RL Suggestion for v1alpha2 (#613 by @andreyvelich)
- Fix problems in the UI for v1alpha2 (#623 by @andreyvelich)
- Updated help message for golint. (#621 by @gyliu513)
- Fix Scheme in Katib Client for v1alpha2 (#620 by @andreyvelich)
- Set trial completion status only after metric collection (#616 by @johnugeorge)
- go unit tests from presubmits (#618 by @johnugeorge)
- Skip creating trials if add count is zero (#617 by @johnugeorge)
- Fix nasrl example in v1alpha2 (#609 by @andreyvelich)
- Enabled make check in travis. (#608 by @gyliu513)
- fix make check (#606 by @gyliu513)
- Fine-grained docker image build. (#605 by @gyliu513)
- Restructuring manifests (#602 by @johnugeorge)
- Fixing latest tag (#603 by @johnugeorge)
- Minor changes to metric collector manifest (#601 by @johnugeorge)
- Mini fix for v1alpha1 metricsCollector (#600 by @hougangliu)
- Check error in OpenSQLConnection (#588 by @andreyvelich)
- Fix issue of hyperband suggestion service cannot move on (#596 by @hougangliu)
- doc: Update readme (#593 by @gaocegege)
- Reverse logic of Less in hyperband v1alpha1 (#592 by @hougangliu)
- Mini fix for getExperimentConf (#594 by @hougangliu)
- feat: Add UI in manifests v1alpha2 (#591 by @gaocegege)
- feat: Support flags in UI (#590 by @gaocegege)
- Default make target to v1alpha2. (#585 by @gyliu513)
- Change undeploy script (#587 by @andreyvelich)
- Added undeploy for katib. (#579 by @gyliu513)
- feat(trial): Add more failure test cases (#570 by @gaocegege)
- Add categories for katib CRDs (#576 by @hougangliu)
- Add Validate Algorithm Settings in v1alpha2 (#574 by @andreyvelich)
- Updated makefile by adding more targets for developer. (#575 by @gyliu513)
- feat(experiment): Add more test cases (#563 by @gaocegege)
- refactor: Use manager client to get log for test (#569 by @gaocegege)
- Adding go tools scripts - part 1 (#573 by @gyliu513)
- Retain for job and metricsCollector (#572 by @hougangliu)
- Fix finalizer cannot work (#571 by @hougangliu)
- Implement GetExperimentInDB (#558 by @hougangliu)
- refactor: Unify the interface (#568 by @gaocegege)
- Implement trial observation metrics (#564 by @johnugeorge)
- fix: Check if the deletion time is nil or zero (#567 by @gaocegege)
- feat(experiment-validator): Inject client (#561 by @gaocegege)
- Change path for yaml file and NAS training containers (#566 by @andreyvelich)
- Added metric name to GetObservationLogRequest (#559 by @johnugeorge)
- Reducing the Prow workflow name length (#562 by @johnugeorge)
- chore: Add code coverage test (#560 by @gaocegege)
- feat(experiment): Add test cases (#556 by @gaocegege)
- chore: Refactor nasenvelopenet (#492 by @gaocegege)
- feat(trial): Refactor trial controller and add basic test cases (#528 by @gaocegege)
- Add status for experiment and trial in additionalPrinterColumns (#555 by @hougangliu)
- Fix default metricsController wrong args (#550 by @hougangliu)
- Add NAS RL yaml example for v1alpha2 (#554 by @andreyvelich)
- Fix trial requestNumber error (#553 by @hougangliu)
- Adding test for random suggestion algorithm (#552 by @johnugeorge)
- Adding minor styles changes (#548 by @johnugeorge)
- Fix nil pointer error when create trial (#547 by @hougangliu)
- Used klog for katib - part 1. (#526 by @gyliu513)
- Implement GetSuggestions for general suggestion (#546 by @hougangliu)
- fix: Fix the conflicts in UI backend (#545 by @gaocegege)
- Earlystopping changes trigger CI based on version (#544 by @hougangliu)
- Adding manifests for manager rest (#542 by @johnugeorge)
- Katib UI for v1alpha2 (#486 by @andreyvelich)
- Enable suggestion-random image build and push in CI (#543 by @hougangliu)
- fix(status): Define status handler (#518 by @gaocegege)
- Include suggestion manager based on version in prow (#541 by @hougangliu)
- Adding random suggestion algorithm implementation and manifests (#540 by @johnugeorge)
- fix: Add test cases for validator and manifest generator (#508 by @gaocegege)
- Update trial status DB operation (#537 by @hougangliu)
- [v1alpha2] Add labels for TFjob and PytorchJob in Metrics Collector (#539 by @richardsliu)
- v1alpha2 TFEvent metrics collector (#538 by @richardsliu)
- Register Trial in DB (#530 by @andreyvelich)
- Restructuring docker files to build images per version (#527 by @johnugeorge)
- Dep ensure to sync up vendor. (#535 by @gyliu513)
- fix: Avoid side effect (#532 by @gaocegege)
- Include vendor dir instead of Gopkg in prow config (#536 by @hougangliu)
- Update experiment status DB operation (#534 by @hougangliu)
- feat(api): Add total number of trials (#501 by @gaocegege)
- Fix wrong error-handling logic in db interface (#529 by @hougangliu)
- chore: Remove dep ensure in CI (#525 by @gaocegege)
- Delete experiment in DB if in need (#519 by @hougangliu)
- Support for Custom Job resources (#512 by @johnugeorge)
- Fix ut test and enable ut-test of v1alpha2 (#524 by @hougangliu)
- godep: Remove useless dep (#521 by @gaocegege)
- Fix prow to trigger corresponding workflow (#520 by @hougangliu)
- create experiment in db (#509 by @hougangliu)
- refactor(suggestion): Use interface (#502 by @gaocegege)
- feat(CI): Run different flow according to version (#516 by @gaocegege)
- Added PR and Issue template. (#505 by @gyliu513)
- Enabled verbose logging for dev guide. (#504 by @gyliu513)
- v1alpha2 metrics collector - controller (#496 by @richardsliu)
- Update util for experiment in v1alpha2 (#485 by @andreyvelich)
- add common package (#491 by @hougangliu)
- Add metrics collector spec and objective spec to Trial (#489 by @richardsliu)
- Prune katib OWNERS file (#490 by @richardsliu)
- Training container for NAS Envelopenet (#429 by @garganubhav)
- NAS Envelopenet Suggestion and Job Example (#425 by @garganubhav)
- V1alpha2 Metrics collector (part 1) (#484 by @richardsliu)
- enable test for katib-manager (#478 by @hougangliu)
- Remove outdated TODOs in README.md (#468 by @terrytangyuan)
- Get experiment config from the instance (#474 by @andreyvelich)
- Fix KatibClient name in v1alpha2 (#483 by @andreyvelich)
- Add Katib Client in v1alpha2 (#480 by @andreyvelich)
- Add metrics collector spec to v1alpha2 API (#481 by @richardsliu)
- vizier-core does not need any role (#482 by @hougangliu)
- katib manager db error (#476 by @hougangliu)
- share one grpc-health-probe (#477 by @hougangliu)
- validation and mutating webhook for experiment (#473 by @hougangliu)
- enable test for v1alpha2 (#465 by @hougangliu)
- Add serviceAccountName in UI deployment (#469 by @andreyvelich)
- chore: Skip test when code is not changed (#467 by @gaocegege)
- Adding initial v1alpha2 API controller (#457 by @johnugeorge)
- v1alpha2 api server implementation (#456 by @YujiOshima)
- fix(readme): Merge image directory (#455 by @gaocegege)
- Update REAME example links for v1alpha1 (#452 by @alexandraj777)
- fix py client import error (#453 by @hougangliu)
- ClusterRoleBinding doesn't need namespace field (#451 by @hougangliu)
- Update API for NAS in v1alpha2 (#450 by @andreyvelich)
- Restructuring test scripts for v1alpha1 and v1alpha2 (#449 by @johnugeorge)
- Code restructuring to support V1alpha1 and V1alpha2 API (#448 by @johnugeorge)
- Fix labels matching the job operator implementation (#447 by @johnugeorge)
- Updating the pytorch example image (#446 by @johnugeorge)
- Remove redundant lock (#444 by @mrkm4ntr)
- add v1alpha2 grpc api (#427 by @YujiOshima)
- Remove katibcli (#436 by @jdplatt)
- Change datadir for avoid failure due to lost+found (#432 by @mrkm4ntr)
- fix demo link (#434 by @jq)
- Add fault tolerance support for trial failure (#424 by @DeeperMind)
- Test for Bayesian Optimization Algo (#406 by @jdplatt)
- Katib v1alpha2 API for CRDs (#381 by @richardsliu)
- Add NAS team as reviewers (#419 by @andreyvelich)
- Multiple Trials for Reinforcement Learning Suggestion (#416 by @DeeperMind)
- Fix the package version in training container (#418 by @DeeperMind)
- Add validation for NAS job in Katib controller (#398 by @andreyvelich)
- Fix path to API protobuf in developer guide (#415 by @andreyvelich)
- Add support for parallel studyjobs (#404 by @DeeperMind)
- Add separable/depthwise convolution, data augmentation and multiple GPU support (#393 by @DeeperMind)
- Add create time to Trial API (#410 by @andreyvelich)
- Metric collector must fail on error (#405 by @johnugeorge)
- add latest tag for katib images (#409 by @hougangliu)
- add build and test for suggestion nasrl (#401 by @hougangliu)
- Database APIs for NAS updated (#394 by @Akado2009)
- Suggestion for Neural Architecture Search with Reinforcement Learning (#339 by @DeeperMind)
- add validating webhook for studyJob (#383 by @hougangliu)
- Removing Operator specific handling during a StudyJob run (#387 by @johnugeorge)
- Delete modeldb from unit tests (#391 by @andreyvelich)
- show studyjob condition when run kubectl get (#389 by @hougangliu)
- Training Container with Model Constructor for cifar10 (#345 by @DeeperMind)
- add studyjob python client (#379 by @hougangliu)
- fix wrong example (#378 by @hougangliu)
- Upgrading controller runtime and k8s to 1.11.2 (#376 by @johnugeorge)
- Properly initialize CI cluster credential (#360 by @toshiiw)
- Include go dependencies in developer-guide.md (#369 by @alexandraj777)
- fix invalid memory address (#368 by @hougangliu)
- Fix presubmits (#363 by @richardsliu)
- Katib 2019 Roadmap (#348 by @richardsliu)
- Update OWNERS (#350 by @richardsliu)
- Extend Katib API for NAS jobs (#327 by @andreyvelich)
- ignore tfjob/pytorch job if corresponding CRD not created (#335 by @hougangliu)
- Clarify the example UI is generated by random-example. (#333 by @gyliu513)
- only try to delete study info in db when in need (#342 by @hougangliu)
- omit empty fields for studyjob status (#336 by @hougangliu)
- Update pytorch example with latest image (#329 by @TimZaman)
- Fix typo in json API (#330 by @richardsliu)
- Add information how to run TFjob and Pytorch examples in Katib (#321 by @andreyvelich)
- Add xgboost example using Bayesian optimization (#320 by @richardsliu)
- katib should be able to be deployed in any namespace (#324 by @hougangliu)
- Adding distributed pytorch example for katib (#309 by @johnugeorge)
- Minor fixes (#307 by @johnugeorge)
- delete obsolete data in db (#315 by @hougangliu)
- add bestTrialId to statusJob status (#312 by @hougangliu)
- Add api doc (#303 by @YujiOshima)
- validate studyJob when first reconcile it (#308 by @hougangliu)
- add hougangliu as a reviewer (#310 by @hougangliu)
- Adding to OWNERS file (#304 by @johnugeorge)
- sync up worker status all the time (#299 by @hougangliu)
- studyJob with non-kubeflow namespace cannot work (#302 by @hougangliu)
- Adding master pod check for default metric collector (#300 by @johnugeorge)
- reduce some redundant code (#296 by @hougangliu)
- Extend studyjob client API (#288 by @andreyvelich)
- Use same deploy.sh when deploy katib components (#284 by @ytetra)
- update Readme (#295 by @hougangliu)
- fix studyJob status suggestionCount mismatch error (#290 by @hougangliu)
- fix invalid worker kind issue (#287 by @hougangliu)
- get metricscollector by API (#292 by @YujiOshima)
- Support Pytorch job in Katib (#283 by @johnugeorge)
- Update k8s cluster version to 1.10 (#286 by @johnugeorge)
- Enrich GUI (#264 by @YujiOshima)
- update README (#281 by @hougangliu)
- fix typo error for MinikubeDemo (#282 by @hougangliu)
- fix typo error (#280 by @hougangliu)
- add e2eTest of each suggestion algorithm (#265 by @ytetra)
- Allow studyjobcontroller to delete pods (#278 by @richardsliu)
- Fix katib ui resource paths (#277 by @richardsliu)
- Implement gRPC Health Checking Protocol + add readiness/liveness probes to vizier-core (#270 by @lkpdn)
- POC: Katib integration with tf-operator (#267 by @richardsliu)
- fix timing to determine slice size in grid search (#271 by @ytetra)
- Add Update{Study,Trial} (#269 by @toshiiw)
- add Richard Liu to OWNERS (#274 by @YujiOshima)
- fix uncompleted value in ui (#238 by @YujiOshima)
- fix bayesian optimization suggestion (#251 by @YujiOshima)
- Prevent pod restarts caused by slow db boot (#261 by @lkpdn)
- add UT of each suggestion algorithm (#237 by @ytetra)
- Downgrade kubernetes dependency to 1.10.1 (#256 by @richardsliu)
- Fix incorrectly set namespace (#260 by @lkpdn)
- Set MYSQL_ROOT_PASSWORD via Secret (#253 by @lkpdn)
- update UI (#255 by @YujiOshima)
- Refactor studyjobcontroller (#254 by @richardsliu)
- Change deploy.sh for Minikube example (#252 by @andreyvelich)
- Add mysql based unit tests (#243 by @toshiiw)
- Update manifests (#246 by @YujiOshima)
- Add texasmichelle as reviewer (#247 by @texasmichelle)
- Tf event mc (#235 by @YujiOshima)
- Fix typos for json and objective (#242 by @toshiiw)
- Add richardsliu to OWNERS/reviewer (#239 by @richardsliu)
- add starttime and completiontime to worker (#236 by @wukong1992)
- Fix typo (#233 by @ytetra)
- More DB unit tests (#234 by @toshiiw)
- Fix the build script after #208 (#231 by @toshiiw)
- Only retry an INSERT operation on unique constraint violation (#229 by @toshiiw)
- New UI for Katib (#208 by @YujiOshima)
- fix slice range (#226 by @ytetra)
- More db tests (#225 by @toshiiw)
- Fix storelogs (#222 by @toshiiw)
- Check errors in order to avoid SEGV (#219 by @toshiiw)
- Fix reqest count (#214 by @YujiOshima)
0.2 (2018-08-20)
- pin mxnet/python image version (#139 by @mayankjuneja)
- Move the GKEDemo into kubeflow/examples (#135 by @jlewi)
- update OWNERS (#129 by @mitake)
- Hyperband (#124 by @YujiOshima)
- add releasing workflow (#113 by @YujiOshima)
- API: Add WorkerStatus to GetMetrics and remove unused items (#110 by @YujiOshima)
- Add e2e test (#114 by @YujiOshima)
- use kubectl port-forward in demos (#111 by @YujiOshima)
- docs: Generate CLI documentation (#105 by @gaocegege)
- changelog: Add (#104 by @gaocegege)
- Corrected typos in hyperband example yml (#146 by @shibuiwilliam)
- Update status of workers in GetWorkers (#127 by @YujiOshima)
- fix doc link and kubectl port-forward command (#120 by @YujiOshima)
- Fix typo (#123 by @mrkm4ntr)
- Fix indentation to use spaces (instead of a mix of tabs and spaces) (#121 by @vinaykakade)
- docs: Fix wrong command (#108 by @mrkm4ntr)
- Remove dlk from manifests (#107 by @vinaykakade)
v0.1.2-alpha (2018-06-05)
Closed issues:
- [request] Invite libbyandhelen as reviewer for algorithm support #82
- cli failed to connect #80
- CreateStudy RPC error: Objective_Value_Name is required #73
- [cli] Use cobra to refactor the cli #54
- Reduce time it takes to build all images #50
- [release] Ksonnet the katib #32
Merged pull requests:
- update docs #103 (YujiOshima)
- cli: Rename to katib-cli #101 (gaocegege)
- Fix dbbug #98 (YujiOshima)
- save logs after check status #95 (YujiOshima)
- Some fix to getting-started.md #94 (lluunn)
- Add katib-cli download command for Mac #93 (vinaykakade)
- fix get service-param-list bug #92 (YujiOshima)
- fix ui bug #91 (YujiOshima)
- Build parallel #85 (YujiOshima)
- reduce build time #84 (YujiOshima)
- OWNERS: Add @libbyandhelen #83 (gaocegege)
- add random forest prior to Bayesian Optimization #81 (libbyandhelen)
- workflows.libsonnet: Fix the image name #75 (gaocegege)
- Refine API #74 (YujiOshima)
- worker: Rename worker_interface to worker #70 (gaocegege)
v0.1.1-alpha (2018-04-24)
Closed issues:
- [upstream] Update name in kubernetes/test-infra #63
- [go] Update the package name, again #62
- [test] Fix broken unit test cases #58
- Provide a cli binary for macOS / darwin #57
- Error running katib on latest master (04/13) #44
- Upload existing models to modelDB interface #43
- [release] Add cli to v0.1.0-alpha #31
- [discussion] Find a new way to install CLI #26
- [maintainance] Setup the repository #8
- Existing approaches and design for hyperparameter-tuning #2
Merged pull requests:
- Cobra cli #69 (YujiOshima)
- *: Refactor the structure #65 (gaocegege)
- *: Update name #64 (gaocegege)
- Replace kubeflow-images-staging with kubeflow-images-public #61 (ankushagarwal)
- improve frontend #60 (YujiOshima)
- argo: Add unit test #56 (gaocegege)
- main.go: Fix style #55 (gaocegege)
- Fix modelsave #52 (YujiOshima)
- refactor Model API #51 (YujiOshima)
- improve test script #49 (YujiOshima)
- Add Model Management API #48 (YujiOshima)
- reviewers: Add @ddysher @jose5918 @mitake #45 (gaocegege)
- add early stoppping service #41 (YujiOshima)
- bayesian optimization draft #38 (libbyandhelen)
- Dockerfile: Use alpine as base image #37 (gaocegege)
- docs: Update katib-cli #36 (gaocegege)
- New db log schema #35 (YujiOshima)
- Fix CI failures #27 (gaocegege)
v0.1.0-alpha (2018-04-10)
Closed issues:
- [suggestion] Move the logic about random service to
random
package #18 - [build-release] Reuse the vendor during the image building process #14
- [go] Rename the package from mlkube/katib to this repo #7
- [go] Establish vendor dependencies for go #5
- Rename to hyperparameter-tuning ? #1
Merged pull requests:
- cleanup of README #30 (ddutta)
- delete unnecessary settings #29 (YujiOshima)
- Dockerfile: Support multiple stage build in dlk and frontend #28 (YujiOshima)
- Dockerfile: Support multiple stage build in manager and cli #25 (gaocegege)
- Dockerfile: Use multiple stage builds #23 (gaocegege)
- Ci setup #22 (YujiOshima)
- suggestion: Refactor #21 (gaocegege)
- update packages #19 (YujiOshima)
- README: Add code quality badge #17 (gaocegege)
- Fixing some basic typos in README #13 (ddutta)
- vendor: Add #12 (gaocegege)
- ignore: Add macOS, Windows and Go ignore files #11 (gaocegege)
- Rename packages and move dlk dir #10 (YujiOshima)
- doc: Refactor #9 (gaocegege)
- add katib code #4 (YujiOshima)
- add OWNERS file #3 (YujiOshima)