2 Overview

This chapter provides an overview of the architecture of the Pmetrics package, including its main software engines, control functions, and other functions for data manipulation, model selection, diagnostics, and plotting.

2.1 R6 architecture

As of v. 2.0, Pmetrics uses an “R6” architecture less dependent on reading and writing files, preserving hard drive space and speeding execution, since file I/O is slow. The goal is to simplfy work within a session by eliminating the need to repeatedly copy and paste files from one folder to the next whenever a new fit is executed. Storing critical objects in memory means they can be used without accessing files.

However, data and model files are still used for longer term storage and preserving work from one session to another. The format of those files, both data and model, will be familiar to long term Pmetrics users.

Data files are generally .csv and the format is detailed in the chapter on data. A model can be read from a text file or can be defined directly in R. The easiest way to accomplish this is with our model builder app. Whether choosing to define models in R, with the builder app, or in a .txt file, details can be found in models.

Note

The R6 architecture is object-oriented, meaning that data and functions are bundled together into objects. Objects have fields which contain data, and methods which are functions that operate on the data. Both are accessed with the $ operator. This is different from the previous functional programming approach (called “S3” in R) used in Legacy Pmetrics, where data and functions were separate. The R6 architecture allows for better organization of code, easier maintenance, and improved performance.

For example, a PM_data object has fields such as $data and $standard_data, which contain the original and standardized data, respectively. It also has methods such as $summary() and $plot().

NOTE: Inclusion of the () after the method name is what distinguishes a method from a field. The parentheses are always required when calling a method, even if there are no arguments.

2.2 Rust as the backend

As of v. 3.0, Pmetrics uses Rust for the core of the package and no longer uses Fortran. This is for several reasons, including the fact that Rust is a modern programming language that is more efficient and easier to maintain than Fortran. Rust is memory safe, which means that it prevents common programming errors that can lead to memory leaks and crashes. This makes it a more reliable choice for high-performance computing tasks. Additionally, Rust has a growing ecosystem of libraries and tools that make it easier to work with data and perform complex computations and a more flexible choice for a wide range of applications.

2.3 Software engines

There are three main software engines that Pmetrics controls currently.

NPAG is the Non-parametric Adaptive Grid software. It will create a non-parametric population model consisting of discrete support points, each with a set of estimates for all parameters in the model plus an associated probability (weight) of that set of estimates. There can be at most one point for each subject in the study population. There is no need for any assumption about the underlying distribution of model parameter values.
NPOD is the Non-parametric Optimal Design software. Like NPAG, it will create a non-parametric population model consisting of discrete support points, each with a set of estimates for all parameters in the model plus an associated probability (weight) of that set of estimates. While NPAG searches parameter hyperspace randomly, NPOD uses likelihood gradients to search systematically. This usually results in faster convergence, but currently, NPOD is more likely to get stuck in a “local” minimum of the likelihood surface, i.e. it has a higher chance of not finding the globally optimal support points. NPOD is under active development.
The Simulator is a semi-parametric Monte Carlo simulation software program that can use the output of NPAG of NPOD to build randomly generated response profiles (e.g. time-concentration curves) for a given population model, parameter estimates, and data input. Simulation from a non-parametric joint density model, i.e. NPAG output, is possible, with each point serving as the mean of a multivariate normal distribution, weighted according to the weight of the point. The covariance matrix of the entire set of support points is divided equally among the points for the purposes of simulation.

Let’s look at how these engines are used to fit models to data and simulate.

2.4 Model functions

2.4.1 Model fitting

flowchart LR

subgraph RSession[" "]
  direction TB
  %% DATA["PM_data"]
  mid(("edit"))
  MODEL["PM_model"]
  RESULT["PM_result"]

  
end

DISK[("Hard Drive")]

MODEL -.-> mid(("edit")) -.-> MODEL
MODEL -- "$fit(PM_data, ...)" --> RESULT
%% RESULT -- "edit" --> MODEL
DISK -- "PM_load()" --> RESULT
RESULT -- "automatic" --> DISK


classDef blue fill:#2f6db6,stroke:#184a8b,color:#fff;
classDef orange fill:#c7662b,stroke:#8a3f18,color:#fff;
classDef disk fill:#d2d3d7,stroke:#7f8084,color:#000;
classDef ghost fill:transparent,stroke:transparent,stroke-width:0px,padding:0px,font-style:italic, color:#555;

class mid ghost;
class DATA,MODEL blue;
class RESULT orange;
class DISK disk;

linkStyle 4 font-style:italic, color:#555

style RSession fill:#e9f0ff,stroke:#9ab0d6,stroke-width:1px,rx:2,ry:2

Pmetrics uses PM_data to create data objects and PM_model to create model objects.
When a model is created, it is compiled and ready to be combined with the data and parameters (...) within the model’s $fit() method.
The $fit() method uses NPAG/NPOD to create a PM_result, which contains the probability distributions for primary model parameter values as well as many methods to summarize, examine, and plot the results.
Edit models to try and improve the fit as needed. This step occupies most of your time when modeling.

PM_data, PM_model, and PM_result objects and their methods, including $fit(), are extensively documented within R and in subsequent chapters here.

Tip

Help for Pmetrics functions and package data (and for any other package) are available in R by using the ?command syntax in your R console, as illustrated below.

# get help for PM_model
?PM_model

The $fit() method in PM_model objects replaces these Legacy functions: ITrun, ERRrun, NPrun.

2.4.2 Simulation

As for all Pmetrics engines, the simulator requires a model and data. The data serve as the template for the simulation, i.e., the dosing and sampling times, covariates, etc. The model defines the structural and error models to be used. Additionally, the simulator requires a population prior joint probability distribution for the parameter values from which it samples. The population parameter values and their distributions can come from either a previous model fit or be defined directly. More details can be found in the simulation chapter.

flowchart LR

subgraph RSession[" "]
  direction TB
  NEW["Prior<br>PM_data<br>PM_model"]
  RESULT["PM_result"]
  SIMULATION["PM_sim"]

end

DISK[("Hard Drive")]


RESULT -- "$sim(<i>PM_model, PM_data</i>, ...)" --> SIMULATION
NEW -- "PM_sim$new(<b>prior, PM_model, PM_data,</b> ...)" --> SIMULATION
SIMULATION -- "$save()" --> DISK
DISK -- "PM_sim$new()" --> SIMULATION


classDef blue fill:#2f6db6,stroke:#184a8b,color:#fff;
classDef orange fill:#c7662b,stroke:#8a3f18,color:#fff;
classDef green fill:#3ca34d,stroke:#276b33,color:#fff;
classDef disk fill:#d2d3d7,stroke:#7f8084,color:#000;

class NEW blue;
class RESULT orange;
class SIMULATION green;
class DISK disk;

style RSession fill:#e9f0ff,stroke:#9ab0d6,stroke-width:1px,rx:2,ry:2

As shown in the diagram above, there are two ways to invoke the simulator, both of which replace the Legacy SIMrun function.

Use the $sim() method attached to PM_result objects. In this case, the model, data, and prior parameter value probability distributions are all obtained from the previous fit contained within the PM_result object. You can optionally override the PM_result model or data by providing new values as arguments to the $sim() method. The model parameters must match the parameters in the original fit, and the data must contain values for any covariates used in the model.
Use PM_sim$new() for models, parameter value probability distributions and template data not derived from a previous fit, e.g. when lifted from an article. Creating simulations this way allows total freedom with respect to the model, the prior, and the data template, but all three inputs (prior, model, data template) are required.

Whether called by $sim() on a PM_result object, or using PM_sim$new(), the simulator will execute directly within R and return a PM_sim object. There is no longer any need to run the Legacy function SIMparse.

2.4.3 Saving

The Pmetrics R6 objects PM_data, PM_result, PM_sim, PM_valid, and PM_pta all have a $save() method. This method saves the object to the hard drive in the current working directory by default. The format is .rds which is a binary format used by R to save individual objects. The purpose of the $save() method is to enable retrieval of the object at a later time. As of v. 3.0, PM_model objects do not have a $save() method, as they should be defined directly in your R script. See models for details.

2.4.4 Loading

After a successful model fit, PM_load() creates a PM_result object rather than loading run results into the current environment and suffixed with the run number as for Legacy Pmetrics, e.g., op.1, final.1, etc. are no longer are created.

To load previously saved .rds ojbects, for PM_sim, PM_valid, and PM_pta, use the $new() method and provide the full or relative path (if not in the working directory) and name of the .rds file created by the corresponding $save() method.

2.4.5 Report generation

The PM_report function is automatically run at the end of a successful fit, and it will generate an HTML page with summaries of the fit, as well as the .Rdata files and other objects. The default browser will be automatically launched for viewing of the HTML report page. You can also regenerate the report at any time by calling the $report() method on a PM_result object.

# example, will not run
run1 <- PM_load(1)
run1$report() # regenerates the report

2.5 Other functions

Within Pmetrics there are also functions to manipulate data.csv files and process and plot extracted data, which we will cover in subsequent chapters.

2.5.1 Data manipulation

Comparison between the current and the Legacy methods are shown for education.

Function	R6	Legacy
Read data file	PM_data$new()	PMreadMatrix()
Check data file	Embedded in PM_data$new()	PMcheck()
Write data file	PM_data$save()	PMwriteMatrix()
Convert calendar dates and clock times	Embedded in PM_data$new()	PMmatrixReltime()
Convert from old USC\*PACK .wrk format	PMwrk2csv()	PMwrk2csv()
Convert from NONMEM	NM2PM()	NM2PM()

2.5.2 Model selection and diagnostics

Comparison between the current and the Legacy methods are shown for education.

Function	R6	Legacy
Compare models	PM_compare(PM_result1, PM_result2,…)	PMcompare(1, 2, …)
Plot residuals	PM_result$op$plot(resid = T,…)	plot(op, resid = T)
Construct VPC, pcVPC, NPDE	PM_result$validate() or PM_valid$new()	makeValid()
Plot VPC, pcVPC, NPDE	PM_valid$plot()	plot(PMvalid)
Stepwise covariate regression	PM_result$step() or PM_result$cov$step()	PMstep()

2.5.3 Other functions

Comparison between the current and the Legacy methods are shown for education.

Function	R6	Legacy
Calculate AUC	$auc() method for PM_result$op/$post/$pop, or PM_sim	makeAUC()
Simulate	PM_result$sim() or PM_sim$new()	SIMrun()
Probability of target attainment	PM_sim$pta() or PM_pta$new()	makePTA()

Again, all functions have extensive help files and examples which can be examined in R by using the help(command) or ?command syntax.