-
Notifications
You must be signed in to change notification settings - Fork 28
ADM Health Check
The ADM Health Check provides a generic overview of the ADM models in your system, including charts like the "Bubble Chart" and many more, with recommendations. It can be used as-is or be used as a basis for further, specific analyses. The ADM Model Reports give a detailed view of an individual ADM model, including the binning details of all the predictors which can help drive insights.
Technically, both are Quarto markdown documents using snippets of Python code from the PDS Tools libraries. We also have legacy R versions of both (using R markdown instead of Quarto).
Whatever option you choose, the ADM datamart needs to made available first. It consists of two tables: one for the models, one with the predictor details - that latter one is optional for the ADM overview. For instructions on how to export the data from Pega, see How to export the ADM Datamart.
There are a number of options to generate the Health Check reports:
Option | When to use | Instructions |
---|---|---|
Github Codespace | Run Health Check in the cloud, directly from GitHub without the need to install any tools | Instructions below |
Standalone Python app | Run a python-based application locally, need to install Python, PDS tools and supporting libraries, but no coding skills required | See instructions on the Health Check application |
VSCode or other IDE | Run the notebooks from a developer environment. Allows for customizations but requires coding skills. | Instructions below for the R versions |
Batch runs | Run the reports in batch - especially convenient when you need to create many individual model reports, or when doing this regularly. Supporting example batch scripts are available, requires some (light) scripting skills and an environment with Python, PDS tools and supporting libraries. | Creating Reports in Batch |
Run Health Check in the cloud, directly from GitHub without the need to install any tools. A Github codespace is a development environment that's hosted in the cloud. Each codespace you create is hosted by GitHub in a Docker container, running on a virtual machine.
- When you do not or cannot have Python or R installed locally on your computer.
- When you want to quickly run an ADM health check report with no need for local set up.
-
This analysis is not processed on your own device, you're running the code on GitHub.
-
You will load your ADM snapshot data into the GitHub codespace. This data will be limited to the model and predictor data from ADM. You can review the structure of the two datasets here.
-
You should review your corporate security policies and ensure that using GitHub cloud codespaces is permitted. Reference the GitHub codespace security policies.
-
Pega will not have visibility of any usage statistics/telemetry data/real data you load into this space.
-
Navigate to the main PDS Tools page on GitHub.
-
Click on the green Code button and choose to add a codespace (you'll need to sign in to GitHub, you can create a free account). It will now automatically set up the code space, run a browser-based version of VSCode, load PDS Tools and start the Health Check application. This will take a few minutes.
-
Ensure that pop ups are allowed, as the Health Check application will open in a new browser tab
-
Before accessing the app, first upload your ADM Datamart files to your GitHub workspace. You can accomplish this by right-clicking within the Explorer section (located on the left-hand side of the interface) and selecting 'Upload'. After the files are uploaded, navigate to the 'Import File' section of the app. Ensure you select the 'Direct file path' option and input the path '/workspaces/pega-datascientist-tools'.
-
Follow the instructions on the Health Check application
The stand-alone health check application makes it easy to create the ADM Health Check and the individual model reports. You will need to have python and install pdstools, but you do not need to run a (data science) environment, and there is no need to create a script, it is all configured from a UI.
-
Install Python and PDS tools with app dependencies. If you already had an older version of PDS tools make sure to upgrade to the latest.
pip install --upgrade 'pdstools[app]'
- Launch the Health Check application by running
pdstools run
- The app should open up in your system browser. On first run, you may get a promotional message from streamlit asking for your e-mail address - you can leave this empty if you want. If the app does not open up automatically, simply copy the Local URL from your terminal and paste it into your browser.
-
In the app, navigate to the Health Check tab (in the left pane). This shows instructions.
-
Then click the "Data Import" tab in the main screen to load your data. If you haven't downloaded the ADM Datamart yet, this is the moment to do so. For instructions on how to export the data from Pega, see How to export the ADM Datamart. For a test drive, you can also skip uploading your own data, and select "CDH Sample" in the Data Import drop down.
- Use Direct file path with the folder path where the ADM files are located. Ex. /User/Downloads/. The tool should automatically find the relevant files in that directory. Note: there is no need to extract the zip files, we will also take care of that for you.
- Use Direct file upload to browse your local files.
-
The "Report Configuration" has a few advanced options but can generally be left empty
-
Then Generate and Download the ADM Health Check report. The download button will appear when the generation is finished. The downloaded report will show in your default browser download locations.
- Install Quarto and Pandoc.
- Install R, R Studio and the pdstools package as per the "Getting started" in the main page
- Either check out ("clone") the Pega Data Scientist Tools repository from git, or (if you are not comfortable with git), just download the Model Overview notebook.
- From Pega, export the ADM datamart data (model data and optionally predictor data) as described in How to export the ADM datamart data.
- Open the notebook "examples/datamart/healthcheck.Rmd" in R Studio and select "Knit with Parameters" from the Knit drop-down.
- Enter the full path to your ADM datamart downloads in the fields for modelfile and predictordatafile. If you do not have predictor data, make that field blank.
- Press the "Knit" button
When finished (it takes a few minutes), the report should open in a browser window. It will also be saved automatically in your working folder. This stand-alone HTML (or PDF) file can then easily be distributed.
Knit option in R Studio | Knit dialog for the Health Check notebook |
---|---|
Follow similar steps to manually create individual model reports. Select "Knit with parameters" and fill in the paths to the ADM datamart download and the ID of the model you want to report on. The ID can be found in the side panel from the Prediction Studio UI, or by loading the model data in R or Python and inspecting it there. The title and description are used in the generated file for informational purposes only.
Knit option in R Studio | Knit dialog for the stand alone Adaptive Model Report notebook |
---|---|