Releases: NVIDIA/spark-rapids-tools
Releases · NVIDIA/spark-rapids-tools
v23.12.0
Packages
- Maven Release: https://repo1.maven.org/maven2/com/nvidia/rapids-4-spark-tools_2.12/23.12.0/
- PyPI Package: https://pypi.org/project/spark-rapids-user-tools/23.12.0/
Changes
User Tools
- Fix user qualification tool runtime error in
get_platform_name
for onprem platform (#684) - [FEA] User tool should pass
--platform
option/argument to Profiling tool (#679) - Fix incorrect processing of short flags for user tools cli (#677)
- Updating new CLI name from ascli to spark_rapids (#673)
- Bump pyarrow version (#664)
- Improve new CLI testing ensuring complete coverage of arguments cases (#652)
Core
- Qualification tool: Add more information for unsupported operators (#680)
- Sync Execs and Expressions from spark-rapids resources (#691)
- Support parsing of inprogress eventlogs (#686)
- Enable features via config that are off by default in the profiler AutoTuner (#668)
- Fix platform names as string constants and reduce redundancy in unit tests (#667)
- Unified platform handling and fetching of operator score files (#661)
- Qualification tool: Ignore some of the unsupported Execs from output (#665)
Miscellaneous
- add markdown link checker (#672)
v23.10.1
Packages
- Maven Release: https://repo1.maven.org/maven2/com/nvidia/rapids-4-spark-tools_2.12/23.10.1/
- PyPI Package: https://pypi.org/project/spark-rapids-user-tools/23.10.1/
Changes
User Tools
- Updating tools docs to remove dead links and profiling docs to not require cluster/worker info (#651)
- Updating autotuner to generation recommendation always, even without cluster info (#650)
- Updating dataproc container cost to be multiplied by number of cores (#648)
- [BUG] Support autoscaling clusters for user qualification tool on Databricks platforms (#647)
- Support extra arguments in new user tools CLI (#646)
- Improve logs with user tools and jar version details (#642)
Core
- Profiling tool: Add support for driver log as input to generate unsupported operators report (#654)
- Updating tools docs to remove dead links and profiling docs to not require cluster/worker info (#651)
- Updating autotuner to generation recommendation always, even without cluster info (#650)
- Qualification tool: Enhance mapping of Execs to stages (#634)
v23.10.0
Packages
- Maven Release: https://repo1.maven.org/maven2/com/nvidia/rapids-4-spark-tools_2.12/23.10.0/
- PyPI Package: https://pypi.org/project/spark-rapids-user-tools/23.10.0/
Changes
User Tools
- Fix system command processing during logging in user tools (#633)
- Fix spinner animation blocking user input in diagnostic tool (#631)
- Enable Dynamic 'Zone' Configuration for Dataproc User Tools (#629)
Core
v23.08.2
Packages
- Maven Release: https://repo1.maven.org/maven2/com/nvidia/rapids-4-spark-tools_2.12/23.08.2/
- PyPI Package: https://pypi.org/project/spark-rapids-user-tools/23.08.2/
Changes
User Tools
- Add unit tests for Dataproc GKE with mock GKE cluster (#618)
- Add support in user tools for running qualification on Dataproc GKE (#612)
- [BUG] Update user tools to use latest Databricks CLI version 0.200+ (#614)
- Add argprocessor unit test for checking error messages for onprem with no eventlogs (#605)
- Updating docs for custom speedup factors for scale factor (#604)
- [FEA] Add qualification user tool options to support external pricing (#595)
- [DOC] Add documentation for qualification user tool pricing discount options (#596)
- [FEA] Add user qualification tool options for specifying pricing discounts for CPU or GPU cluster, or both (#583)
- Add diagnostic capabilities for Databricks (AWS/Azure) environments (#533)
- Add verbose option to the CLI (#550)
- [FEA] Remove URLs from pydantic error messages (#560)
- Rename and change pyrapids to spark_rapids_tools (#570)
- Fix sdk_monitor exception thrown by abfs protocol (#569)
Core
- Generating speedup factors for Dataproc GKE L4 GPU instances (#617)
- Qualification tool: Add penalty for row conversions (#471)
- Add support in core tools for running qualification on Dataproc GKE (#613)
- Sync up remaining updated execs and exprs from rapids-plugin (#602)
- Adding speedup factors for Dataproc Serverless and docs fix (#603)
- Add xxhash64 function as supported in qualification tools (#597)
- Fix ProjectExecParser to include digits in expression names (#592)
- [FEA] Add json_tuple function as supported in qualification tool (#589)
- [FEA] Add flatten function as supported in qualification tool (#587)
- [FEA] Sync up conv function with rapids-plugin resources (#573)
Miscellaneous
v23.08.1
Packages
- Maven Release: https://repo1.maven.org/maven2/com/nvidia/rapids-4-spark-tools_2.12/23.08.1/
- PyPI Package: https://pypi.org/project/spark-rapids-user-tools/23.08.1/
Changes
User Tools
- [DOC] Fix help command in documentation (#540)
- Implement a cross-CSP storage driver (#485)
- Build tools package as single artifact for restricted environments (#516)
Core
- Remove memoryOverhead recommendations for Standalone Spark (#557)
- [FEA] Add support to TIMESTAMP functions (#549)
- Fix handling of current_database and ArrayBuffer (#556)
- Add
translate
as supported expression in qualification tools (#546) - Adding TakeOrderedAndProject and BroadcastNestedLoopJoin, removing Project from speedup generation (#548)
- Qualification should treat promote_precision as supported (#545)
- Improve tool error message for files with text extensions (#544)
- Improve parsing of aggregate expressions (#535)
- Bump default build to use Spark-333 (#537)
- Improve AutoTuner plugin recommendation for Fat mode (#543)
- Updating speedup generation for more execs from NDS + validation script (#530)
- [FEA] Reset speedup factors for qualification tool in EMR 6.12 environments (#529)
- Add min, median and max columns to AccumProfileResults (#522)
- [FEA] Reset speedup factors for qualification tool in Databricks 12.2 environments (#524)
- Filter parser should check ignored-functions (#520)
- Update speedup factors for qualification tool in Dataproc 2.1 environments (#509)
Miscellaneous
v23.08.0
Packages
- Maven Release: https://repo1.maven.org/maven2/com/nvidia/rapids-4-spark-tools_2.12/23.08.0/
- PyPI Package: https://pypi.org/project/spark-rapids-user-tools/23.08.0/
Changes
User Tools
- Support offline execution of user tools in restricted environments (#497)
- Handle deprecation errors in python packaging (#513)
- Adds profiling support for EMR in user tools. (#500)
Core
- Fix unit-tests for Spark-340 and Add spark-versions to gh-workflow (#503)
Miscellaneous
- fix gh-workflow for Python unit-tests (#505)
- Refactoring the speedup factor generation to support WholeStageCodegen parsing and environment defaults (#493)
- Try fix push issue in release action [skip ci] (#495)
- Revert "Push to protected branch using third-party action (#492)" (#494)
- Push to protected branch using third-party action (#492)
- Add secrets in the release.yml (#491)
- Add sign-off and token in release workflow (#490)
v23.06.4
Packages
- Maven Release: https://repo1.maven.org/maven2/com/nvidia/rapids-4-spark-tools_2.12/23.06.4/
- PyPI Package: https://pypi.org/project/spark-rapids-user-tools/23.06.4/
Changes
User Tools
- Creating custom speedup factors README with generation script (#488)
- Bump dev-version to 23.06.4 (#468)
Core
- [FEA] Enhance qualification tool to handle custom speedup factor file as input (#475)
- Bump dev-version to 23.06.4 (#468)
Miscellaneous
v23.06.3
New minor release 23.06.3:
- mvn jar file: https://repo1.maven.org/maven2/com/nvidia/rapids-4-spark-tools_2.12/23.06.3/
- pypi package: https://pypi.org/project/spark-rapids-user-tools/
Changes:
- Bump dev version to 23.06.3 (#442) (githubweb)
- Qualification command should run without cluster-info (#444) (githubweb)
- Fix mvn warning message for formatting string (#446) ([githubweb)
- Add Recommendations in AutoTuner for Databricks platform (#440) (githubweb)
- Update the minimum spark version requirements to run tools (#452) (githubweb)
- Generate speedup factors for A10 instance types in EMR (#450) (githubweb)
- Qualification tool: Add ExistenceJoin to supported jointype (#432) (githubweb)
- Bump certifi from 2022.12.7 to 2023.7.22 (#460) (githubweb)
- Add Progress Bar in Profiling tools (#456) (githubweb)
- [FEA] Include bootstrap recommended configs in qualification output (#451) (githubweb)
- Status Reporting for Event Logs and Applications in Qualification Tool (#464) (githubweb)
- Support wildcards in eventlogs arguments (#465) (githubweb)
- Refactor arguments to handle options after event logs (#466) (githubweb)
- Update version by jenkins-spark-rapids-tools-auto-release-30 (githubweb)
v23.06.2
New minor release 23.06.2:
- mvn jar file: https://repo1.maven.org/maven2/com/nvidia/rapids-4-spark-tools_2.12/23.06.2/
- pypi package: https://pypi.org/project/spark-rapids-user-tools/
Changes:
- Qualification tool to report with granularity on unsupported operators (#401) (githubweb)
- Bug fix for supported expressions (#408) (githubweb)
- [DOC] Fix documentation and consistency in arguments for the EMR user-tools (#413) (githubweb)
- Remove serverless execution mode from the EMR-wrapper (#421) (githubweb)
- Remove cluster-mode execution wrapper (#424) (githubweb)
- [TASK] Implement classes and methods to support user tools profiling command for Databricks Azure (#423) (githubweb)
- Add platform argument in profiling tool for custom tuning based on the platform (#414) (githubweb)
- [DOC] Add documentation for running user tools profiling command for Databricks Azure (#430) (githubweb)
- update diagnostic unit-tests to support var directory (#429) (githubweb)
- Fix broken tables and code blocks in profiling-tool docs (#435) (githubweb)
- Support onprem platform for user-tools profiling command (#431) (githubweb)
- Add ascli as a new simplified tools command (#426) (details
- update more details for databricks azure user qualification tool doc (#438) (githubweb)
- Bump build version to 23.06.2 (#439) (details
v23.02.0 rapids-4-spark-tools jar release
Release of Qualification and Profiling tools jar.
You can find the download URL here: https://repo.maven.apache.org/maven2/com/nvidia/rapids-4-spark-tools_2.12/23.02.0/