1 Background

The MetaboDynamics vignette explains the package workflow and interpretation of results with a SummarizedExperiment object as input. In this vignette I will show the usage of MetaboDynamics with a data frame as input.

2 Setup: load required packages

library(MetaboDynamics)
library(SummarizedExperiment) # storing and manipulating simulated metabolomics data
library(ggplot2) # visualization
library(dplyr) # data handling
library(tidyr) # data handling

3 Load data

The MetaboDynamics package also includes a simulated example data set in a data frame format. To load the data set in data frame format, execute the following code:

data("longitudinalMetabolomics_df", package = "MetaboDynamics")

4 Model dynamics

To keep run time of this vignette short we will execute the workflow on a subset of five metabolites across all conditions.

data("longitudinalMetabolomics_df")
# take a sample (of five metabolites and subset data
samples <- c(
  "UMP", "PEP",
  "2-Aminomuconate","ATP"
)
data <- longitudinalMetabolomics_df %>% filter(metabolite %in% samples)

head(data)

## # A tibble: 6 × 8
## # Groups:   metabolite, condition [1]
##   metabolite condition  time replicate measurement log_m m_scaled KEGG  
##   <chr>      <chr>     <int>     <int>       <dbl> <dbl>    <dbl> <chr> 
## 1 PEP        A             1         1     459223.  5.66    1.66  C00074
## 2 PEP        A             1         2      63932.  4.81   -0.207 C00074
## 3 PEP        A             1         3      34766.  4.54   -0.783 C00074
## 4 PEP        A             2         1       9522.  3.98   -2.01  C00074
## 5 PEP        A             2         2      70377.  4.85   -0.116 C00074
## 6 PEP        A             2         3      46996.  4.67   -0.498 C00074

The measured metabolites are stored in column “metabolite”, time points are specified in column “time”, and column “condition” specifies the experimental condition.

The required log-transformed and scaled (per metabolite and condition to a mean of 0 and sd of 1) metabolite abundances are stored in column “m_scaled”. The function fit_dynamics_model() allows to specify other column names for metabolite, time points, condition and scaled measurement.

First we will fit the dynamics model and extract the diagnostics:

# fit dynamics model
fit <- #  when using a data frame as input fits have to stored in a separate object
  fit_dynamics_model(
    model = "scaled_log",
    data = data,
    scaled_measurement = "m_scaled", # in which column are scaled measurments stored?
    max_treedepth = 10,
    adapt_delta = 0.99, # default 0.95
    iter = 2000,
    warmup = 2000 / 4, # default is 1/4 of iterations
    chains = 1, # only set to 1 in vignette, recommended default is 4!
    cores = 1 # only set to 1 in vignette, can be same as chains if machine allows for parallelization
  )

## Are your metabolite concentrations normalized and standardized?
##           We recommend normalization by log-transformation.
##           Scaling and centering (mean=0, sd=1) should be metabolite and condition specific.

## 
## SAMPLING FOR MODEL 'm_ANOVA_partial_pooling_euclidean_distance' NOW (CHAIN 1).
## Chain 1: 
## Chain 1: Gradient evaluation took 5.7e-05 seconds
## Chain 1: 1000 transitions using 10 leapfrog steps per transition would take 0.57 seconds.
## Chain 1: Adjust your expectations accordingly!
## Chain 1: 
## Chain 1: 
## Chain 1: Iteration:    1 / 2000 [  0%]  (Warmup)
## Chain 1: Iteration:  200 / 2000 [ 10%]  (Warmup)
## Chain 1: Iteration:  400 / 2000 [ 20%]  (Warmup)
## Chain 1: Iteration:  501 / 2000 [ 25%]  (Sampling)
## Chain 1: Iteration:  700 / 2000 [ 35%]  (Sampling)
## Chain 1: Iteration:  900 / 2000 [ 45%]  (Sampling)
## Chain 1: Iteration: 1100 / 2000 [ 55%]  (Sampling)
## Chain 1: Iteration: 1300 / 2000 [ 65%]  (Sampling)
## Chain 1: Iteration: 1500 / 2000 [ 75%]  (Sampling)
## Chain 1: Iteration: 1700 / 2000 [ 85%]  (Sampling)
## Chain 1: Iteration: 1900 / 2000 [ 95%]  (Sampling)
## Chain 1: Iteration: 2000 / 2000 [100%]  (Sampling)
## Chain 1: 
## Chain 1:  Elapsed Time: 1.228 seconds (Warm-up)
## Chain 1:                2.925 seconds (Sampling)
## Chain 1:                4.153 seconds (Total)
## Chain 1:

This returns a model fit.

# Extract diagnostics
diagnostics <- # when using a data frame as input diagnostics have to be stored in a separate object
  diagnostics_dynamics(
    data = data, # data frame that was used to fit the dynamics model,
    fit = fit, # list of fits from dynamics model, result of fit_dynamics_mode function
    iter = 2000, # how many iterations were used to fit the dynamics model
    chains = 1, # how many chains were used to fit the dynamics model
  )

That returns a list with elements [[“model_diagnostics”]] which holds a summary of the model diagnostics and the extracted posterior estimates. [["posterior]] holds the posterior predictions needed for the posterior predictive check (PPC)

To visualize the diagnostics and the PPC the following code can be used:

plot_diagnostics(
  data = data, # data frame used to fit the dynamics model
  diagnostics = diagnostics[["model_diagnostics"]] # summary of diagnostics
)

## $divergences

## 
## $max_treedepth

## 
## $Rhat

## 
## $n_eff

plot_PPC(
  data = data, posterior = diagnostics[["posterior"]],
  scaled_measurement = "m_scaled"
)

## Warning: Removed 2088 rows containing non-finite outside the scale range
## (`stat_ydensity()`).

In this case the diagnostics are all indicating a sane model fit: The model fitting did not result in divergent transitions, the maximum tree depth was not exceeded, Rhat values were below 1.01 and number of effective samples exceeded 100.

5 Extract estimates and visualize results

To extract the estimates one has to specify again the data, list of fits and number of iterations and chains used to fit the model. Additionally is is possible to specify the columns in which metabolite names, experimental conditions and time points are stored. This is equivalent to the fit_dynamics_model function.

Additionally one can specify how many samples should be drawn from the posterior to for example test clustering performance.

estimates <- # estimates have to be stored in a separate object when using data frames
  estimates_dynamics(
    data = data,
    fit = fit)

This returns a list containing the estimates for every experimental condition. The estimates can be visualized in two ways by the following code:

# only visualize differences between time points
plot_estimates(
  # does not need data as input
  estimates = estimates,
  delta_t = TRUE, # choose to visualize differences between time points
  dynamics = FALSE,
  distance_conditions = FALSE
)

## $delta_t
## $delta_t$`A_2-1`

## 
## $delta_t$`A_3-1`

## 
## $delta_t$`A_4-1`

## 
## $delta_t$`A_4-3`

## 
## $delta_t$`A_4-2`

## 
## $delta_t$`A_3-2`

## 
## $delta_t$`B_4-2`

## 
## $delta_t$`B_4-1`

## 
## $delta_t$`B_3-2`

## 
## $delta_t$`B_4-3`

## 
## $delta_t$`B_3-1`

## 
## $delta_t$`B_2-1`

# only visualize dynamics
plot_estimates(
  estimates = estimates,
  delta_t = FALSE,
  distance_conditions = FALSE,
  dynamics = TRUE
) # choose to visualize the dynamics

## $dynamcis

# only visualize euclidean distance between dynamics vectors
# only visualize dynamics
plot_estimates(
  estimates = estimates,
  delta_t = FALSE,
  distance_conditions = TRUE,
  dynamics = FALSE
) # choose to visualize the dynamics

## $distance_conditions
## $distance_conditions$A_B

## `height` was translated to `width`.

6 Clustering dynamics

To cluster the dynamics we need either the estimates from the dynamics model (stored in object “estimates”) or a data frame in which the means of measurements are stored in a column named “mu_mean”, categorical time points are stored in ascending order in column “time.ID” and experimental conditions in a column named “condition”.

The following chunk of code shows the needed data frame format for the clustering function:

head(estimates)

## $mu
##                metabolite time condition parameter        mean        2.5%
## mu[1,1,1]             PEP    1         A        mu  0.97630044 -0.14048052
## mu[1,1,2]             PEP    1         B        mu  0.69739944 -0.97859542
## mu[1,2,1]             PEP    2         A        mu -0.47456080 -1.16948821
## mu[1,2,2]             PEP    2         B        mu -0.11637819 -1.29954249
## mu[1,3,1]             PEP    3         A        mu -0.36826085 -1.56606802
## mu[1,3,2]             PEP    3         B        mu -0.16974815 -1.01328022
## mu[1,4,1]             PEP    4         A        mu -0.13619897 -1.86096215
## mu[1,4,2]             PEP    4         B        mu -0.44338924 -2.29375652
## mu[2,1,1]             UMP    1         A        mu  0.23888047 -0.94207142
## mu[2,1,2]             UMP    1         B        mu  0.27423732 -0.65602576
## mu[2,2,1]             UMP    2         A        mu -0.23943816 -1.07732788
## mu[2,2,2]             UMP    2         B        mu  0.85668085 -0.45415622
## mu[2,3,1]             UMP    3         A        mu  0.55274684 -0.43374051
## mu[2,3,2]             UMP    3         B        mu -0.10258653 -0.96653740
## mu[2,4,1]             UMP    4         A        mu -0.42342611 -2.42165451
## mu[2,4,2]             UMP    4         B        mu -0.97173501 -2.28690000
## mu[3,1,1]             ATP    1         A        mu  0.18630283 -1.59008344
## mu[3,1,2]             ATP    1         B        mu  0.04092512 -1.14616884
## mu[3,2,1]             ATP    2         A        mu -0.76707559 -2.23005805
## mu[3,2,2]             ATP    2         B        mu -0.40010536 -2.16463618
## mu[3,3,1]             ATP    3         A        mu  0.15213998 -1.52140333
## mu[3,3,2]             ATP    3         B        mu  0.04825043 -1.98290586
## mu[3,4,1]             ATP    4         A        mu  0.42739300 -0.57438690
## mu[3,4,2]             ATP    4         B        mu  0.38587685 -0.57518009
## mu[4,1,1] 2-Aminomuconate    1         A        mu  0.58111696 -1.33838311
## mu[4,1,2] 2-Aminomuconate    1         B        mu -0.75914921 -1.47195096
## mu[4,2,1] 2-Aminomuconate    2         A        mu -0.87366077 -1.87828808
## mu[4,2,2] 2-Aminomuconate    2         B        mu  0.71064303 -1.31762848
## mu[4,3,1] 2-Aminomuconate    3         A        mu  0.32963698 -0.02706729
## mu[4,3,2] 2-Aminomuconate    3         B        mu -0.06014678 -0.45888114
## mu[4,4,1] 2-Aminomuconate    4         A        mu -0.10406272 -1.40856559
## mu[4,4,2] 2-Aminomuconate    4         B        mu -0.06417609 -0.70788409
##               97.5%
## mu[1,1,1] 1.9550010
## mu[1,1,2] 2.2602577
## mu[1,2,1] 0.2440841
## mu[1,2,2] 1.0162598
## mu[1,3,1] 0.6755357
## mu[1,3,2] 0.6389410
## mu[1,4,1] 1.7640306
## mu[1,4,2] 1.2284942
## mu[2,1,1] 1.4454462
## mu[2,1,2] 1.1330341
## mu[2,2,1] 0.6104708
## mu[2,2,2] 2.1178131
## mu[2,3,1] 1.6568232
## mu[2,3,2] 0.7540832
## mu[2,4,1] 1.5303000
## mu[2,4,2] 0.7454387
## mu[3,1,1] 1.9625621
## mu[3,1,2] 1.1406606
## mu[3,2,1] 1.0104528
## mu[3,2,2] 1.6084111
## mu[3,3,1] 1.7054187
## mu[3,3,2] 1.7084651
## mu[3,4,1] 1.1533385
## mu[3,4,2] 1.4433042
## mu[4,1,1] 2.3104495
## mu[4,1,2] 0.3012983
## mu[4,2,1] 0.3299285
## mu[4,2,2] 2.4142256
## mu[4,3,1] 0.6550311
## mu[4,3,2] 0.3621274
## mu[4,4,1] 1.1796653
## mu[4,4,2] 0.5448305
## 
## $sigma
##                   metabolite time condition parameter      mean       2.5%
## sigma[1,1,1]             PEP    1         A     sigma 0.7963643 0.28515749
## sigma[1,1,2]             PEP    1         B     sigma 1.5673009 0.63494076
## sigma[1,2,1]             PEP    2         A     sigma 0.5591411 0.18127065
## sigma[1,2,2]             PEP    2         B     sigma 0.9126059 0.32649104
## sigma[1,3,1]             PEP    3         A     sigma 0.7500079 0.23081456
## sigma[1,3,2]             PEP    3         B     sigma 0.5596638 0.16140724
## sigma[1,4,1]             PEP    4         A     sigma 1.7970005 0.80897609
## sigma[1,4,2]             PEP    4         B     sigma 1.7199636 0.77255658
## sigma[2,1,1]             UMP    1         A     sigma 1.0041201 0.37173599
## sigma[2,1,2]             UMP    1         B     sigma 0.6816466 0.23671234
## sigma[2,2,1]             UMP    2         A     sigma 0.6309947 0.20146370
## sigma[2,2,2]             UMP    2         B     sigma 1.1549888 0.44091953
## sigma[2,3,1]             UMP    3         A     sigma 0.7564592 0.25499170
## sigma[2,3,2]             UMP    3         B     sigma 0.5756320 0.16899540
## sigma[2,4,1]             UMP    4         A     sigma 2.0233512 0.93434702
## sigma[2,4,2]             UMP    4         B     sigma 1.3519363 0.54745165
## sigma[3,1,1]             ATP    1         A     sigma 1.6015118 0.65894914
## sigma[3,1,2]             ATP    1         B     sigma 0.8892163 0.30409111
## sigma[3,2,1]             ATP    2         A     sigma 1.3327907 0.54662461
## sigma[3,2,2]             ATP    2         B     sigma 1.7798579 0.75163827
## sigma[3,3,1]             ATP    3         A     sigma 1.4641724 0.55022049
## sigma[3,3,2]             ATP    3         B     sigma 1.8060726 0.77105880
## sigma[3,4,1]             ATP    4         A     sigma 0.6047970 0.17846393
## sigma[3,4,2]             ATP    4         B     sigma 0.7614431 0.22969985
## sigma[4,1,1] 2-Aminomuconate    1         A     sigma 1.7075296 0.75935275
## sigma[4,1,2] 2-Aminomuconate    1         B     sigma 0.5363479 0.16081754
## sigma[4,2,1] 2-Aminomuconate    2         A     sigma 0.9056743 0.34208721
## sigma[4,2,2] 2-Aminomuconate    2         B     sigma 1.8174049 0.81085590
## sigma[4,3,1] 2-Aminomuconate    3         A     sigma 0.2401835 0.05420669
## sigma[4,3,2] 2-Aminomuconate    3         B     sigma 0.2629060 0.05944888
## sigma[4,4,1] 2-Aminomuconate    4         A     sigma 1.0811620 0.39270026
## sigma[4,4,2] 2-Aminomuconate    4         B     sigma 0.4241660 0.12290841
##                  97.5%
## sigma[1,1,1] 2.1671422
## sigma[1,1,2] 3.8405632
## sigma[1,2,1] 1.7668383
## sigma[1,2,2] 2.6807549
## sigma[1,3,1] 2.2804793
## sigma[1,3,2] 1.8532120
## sigma[1,4,1] 4.1138138
## sigma[1,4,2] 3.9297614
## sigma[2,1,1] 2.5672570
## sigma[2,1,2] 1.8474976
## sigma[2,2,1] 1.8070786
## sigma[2,2,2] 2.9066850
## sigma[2,3,1] 2.3046556
## sigma[2,3,2] 1.7465832
## sigma[2,4,1] 4.1684746
## sigma[2,4,2] 3.3670142
## sigma[3,1,1] 3.8221619
## sigma[3,1,2] 2.6828535
## sigma[3,2,1] 3.2867508
## sigma[3,2,2] 4.0833796
## sigma[3,3,1] 3.7595448
## sigma[3,3,2] 4.2080075
## sigma[3,4,1] 2.1176336
## sigma[3,4,2] 2.2520458
## sigma[4,1,1] 3.8892671
## sigma[4,1,2] 1.8139593
## sigma[4,2,1] 2.3883463
## sigma[4,2,2] 4.0760970
## sigma[4,3,1] 0.8410887
## sigma[4,3,2] 1.0971607
## sigma[4,4,1] 2.8711087
## sigma[4,4,2] 1.4385154
## 
## $lambda
##                  metabolite condition parameter      mean      2.5%    97.5%
## lambda[1,1]             PEP         A    lambda 0.8873545 0.2535478 1.960330
## lambda[1,2]             PEP         B    lambda 0.7874259 0.2220466 1.771303
## lambda[2,1]             UMP         A    lambda 0.8100720 0.2575735 1.799930
## lambda[2,2]             UMP         B    lambda 0.9188968 0.2633403 2.054432
## lambda[3,1]             ATP         A    lambda 0.7436686 0.2034361 1.697141
## lambda[3,2]             ATP         B    lambda 0.7318452 0.2066276 1.645760
## lambda[4,1] 2-Aminomuconate         A    lambda 0.8699647 0.2578516 1.859850
## lambda[4,2] 2-Aminomuconate         B    lambda 1.0582119 0.3172107 2.358795
## 
## $delta_mu
##                        metabolite condition timepoint_1 timepoint_2 parameter
## delta_mu[1,1,1,2]             PEP         A           1           2  delta_mu
## delta_mu[1,1,1,3]             PEP         A           1           3  delta_mu
## delta_mu[1,1,1,4]             PEP         A           1           4  delta_mu
## delta_mu[1,1,2,3]             PEP         A           2           3  delta_mu
## delta_mu[1,1,2,4]             PEP         A           2           4  delta_mu
## delta_mu[1,1,3,4]             PEP         A           3           4  delta_mu
## delta_mu[1,2,1,2]             PEP         B           1           2  delta_mu
## delta_mu[1,2,1,3]             PEP         B           1           3  delta_mu
## delta_mu[1,2,1,4]             PEP         B           1           4  delta_mu
## delta_mu[1,2,2,3]             PEP         B           2           3  delta_mu
## delta_mu[1,2,2,4]             PEP         B           2           4  delta_mu
## delta_mu[1,2,3,4]             PEP         B           3           4  delta_mu
## delta_mu[2,1,1,2]             UMP         A           1           2  delta_mu
## delta_mu[2,1,1,3]             UMP         A           1           3  delta_mu
## delta_mu[2,1,1,4]             UMP         A           1           4  delta_mu
## delta_mu[2,1,2,3]             UMP         A           2           3  delta_mu
## delta_mu[2,1,2,4]             UMP         A           2           4  delta_mu
## delta_mu[2,1,3,4]             UMP         A           3           4  delta_mu
## delta_mu[2,2,1,2]             UMP         B           1           2  delta_mu
## delta_mu[2,2,1,3]             UMP         B           1           3  delta_mu
## delta_mu[2,2,1,4]             UMP         B           1           4  delta_mu
## delta_mu[2,2,2,3]             UMP         B           2           3  delta_mu
## delta_mu[2,2,2,4]             UMP         B           2           4  delta_mu
## delta_mu[2,2,3,4]             UMP         B           3           4  delta_mu
## delta_mu[3,1,1,2]             ATP         A           1           2  delta_mu
## delta_mu[3,1,1,3]             ATP         A           1           3  delta_mu
## delta_mu[3,1,1,4]             ATP         A           1           4  delta_mu
## delta_mu[3,1,2,3]             ATP         A           2           3  delta_mu
## delta_mu[3,1,2,4]             ATP         A           2           4  delta_mu
## delta_mu[3,1,3,4]             ATP         A           3           4  delta_mu
## delta_mu[3,2,1,2]             ATP         B           1           2  delta_mu
## delta_mu[3,2,1,3]             ATP         B           1           3  delta_mu
## delta_mu[3,2,1,4]             ATP         B           1           4  delta_mu
## delta_mu[3,2,2,3]             ATP         B           2           3  delta_mu
## delta_mu[3,2,2,4]             ATP         B           2           4  delta_mu
## delta_mu[3,2,3,4]             ATP         B           3           4  delta_mu
## delta_mu[4,1,1,2] 2-Aminomuconate         A           1           2  delta_mu
## delta_mu[4,1,1,3] 2-Aminomuconate         A           1           3  delta_mu
## delta_mu[4,1,1,4] 2-Aminomuconate         A           1           4  delta_mu
## delta_mu[4,1,2,3] 2-Aminomuconate         A           2           3  delta_mu
## delta_mu[4,1,2,4] 2-Aminomuconate         A           2           4  delta_mu
## delta_mu[4,1,3,4] 2-Aminomuconate         A           3           4  delta_mu
## delta_mu[4,2,1,2] 2-Aminomuconate         B           1           2  delta_mu
## delta_mu[4,2,1,3] 2-Aminomuconate         B           1           3  delta_mu
## delta_mu[4,2,1,4] 2-Aminomuconate         B           1           4  delta_mu
## delta_mu[4,2,2,3] 2-Aminomuconate         B           2           3  delta_mu
## delta_mu[4,2,2,4] 2-Aminomuconate         B           2           4  delta_mu
## delta_mu[4,2,3,4] 2-Aminomuconate         B           3           4  delta_mu
##                           mean        2.5%      97.5%
## delta_mu[1,1,1,2] -1.450861238 -2.61234281 -0.1060866
## delta_mu[1,1,1,3] -1.344561293 -2.87697457  0.2266643
## delta_mu[1,1,1,4] -1.112499406 -3.05716137  1.2266849
## delta_mu[1,1,2,3]  0.106299945 -1.33829611  1.3238930
## delta_mu[1,1,2,4]  0.338361832 -1.59949233  2.3298731
## delta_mu[1,1,3,4]  0.232061887 -1.74426988  2.3313695
## delta_mu[1,2,1,2] -0.813777626 -2.87232465  1.2332306
## delta_mu[1,2,1,3] -0.867147583 -2.55033868  0.9450247
## delta_mu[1,2,1,4] -1.140788672 -3.34453125  1.0520666
## delta_mu[1,2,2,3] -0.053369957 -1.48875124  1.5156161
## delta_mu[1,2,2,4] -0.327011046 -2.52006808  1.8381351
## delta_mu[1,2,3,4] -0.273641089 -2.25969181  1.4834784
## delta_mu[2,1,1,2] -0.478318626 -1.89363173  0.8212456
## delta_mu[2,1,1,3]  0.313866372 -1.23695071  1.9328646
## delta_mu[2,1,1,4] -0.662306585 -2.95896555  1.6778169
## delta_mu[2,1,2,3]  0.792184998 -0.53710951  2.0832463
## delta_mu[2,1,2,4] -0.183987959 -2.23447796  1.9417981
## delta_mu[2,1,3,4] -0.976172957 -3.23558048  1.1910606
## delta_mu[2,2,1,2]  0.582443524 -1.03199257  2.1366092
## delta_mu[2,2,1,3] -0.376823853 -1.48965778  0.8695545
## delta_mu[2,2,1,4] -1.245972336 -2.83097595  0.5979581
## delta_mu[2,2,2,3] -0.959267377 -2.44769416  0.6503955
## delta_mu[2,2,2,4] -1.828415860 -3.71690652  0.3275042
## delta_mu[2,2,3,4] -0.869148483 -2.36441039  0.8913739
## delta_mu[3,1,1,2] -0.953378421 -3.18711996  1.5559959
## delta_mu[3,1,1,3] -0.034162853 -2.49628078  2.2948841
## delta_mu[3,1,1,4]  0.241090173 -1.75061165  2.1208305
## delta_mu[3,1,2,3]  0.919215567 -1.22797602  3.1045049
## delta_mu[3,1,2,4]  1.194468594 -0.70832791  2.8251219
## delta_mu[3,1,3,4]  0.275253027 -1.40481782  1.9929387
## delta_mu[3,2,1,2] -0.441030479 -2.47494431  1.9051106
## delta_mu[3,2,1,3]  0.007325306 -2.37867811  2.2729069
## delta_mu[3,2,1,4]  0.344951729 -1.12058824  2.0109705
## delta_mu[3,2,2,3]  0.448355786 -2.26794198  2.9327964
## delta_mu[3,2,2,4]  0.785982208 -1.21398481  2.6620279
## delta_mu[3,2,3,4]  0.337626423 -1.58148981  2.5076446
## delta_mu[4,1,1,2] -1.454777727 -3.44572524  0.6320860
## delta_mu[4,1,1,3] -0.251479971 -2.01592571  1.6847207
## delta_mu[4,1,1,4] -0.685179679 -2.87573504  1.5875167
## delta_mu[4,1,2,3]  1.203297756 -0.06366922  2.2476298
## delta_mu[4,1,2,4]  0.769598048 -0.92589804  2.4688473
## delta_mu[4,1,3,4] -0.433699708 -1.75886171  0.9268542
## delta_mu[4,2,1,2]  1.469792237 -0.94154065  3.3499619
## delta_mu[4,2,1,3]  0.699002429 -0.35864082  1.5471064
## delta_mu[4,2,1,4]  0.694973117 -0.58810655  1.5658933
## delta_mu[4,2,2,3] -0.770789808 -2.57757167  1.2782887
## delta_mu[4,2,2,4] -0.774819120 -2.64311617  1.2299452
## delta_mu[4,2,3,4] -0.004029312 -0.75102888  0.7299931
## 
## $euclidean_distances
##                                metabolite condition_1 condition_2
## euclidean_distance[1,1,2]             PEP           A           B
## euclidean_distance[2,1,2]             UMP           A           B
## euclidean_distance[3,1,2]             ATP           A           B
## euclidean_distance[4,1,2] 2-Aminomuconate           A           B
##                                    parameter     mean      2.5%    97.5%
## euclidean_distance[1,1,2] euclidean_distance 1.754368 0.5575603 3.746509
## euclidean_distance[2,1,2] euclidean_distance 2.103234 0.8214704 3.843347
## euclidean_distance[3,1,2] euclidean_distance 1.906488 0.6045208 4.012833
## euclidean_distance[4,1,2] euclidean_distance 2.523609 0.9837851 4.406038

With the output from the estimates_dynamics function we can cluster metabolite dynamics per experimental condition.

cluster <- # clustering results have to be stored in separate object when using data frame as input
  cluster_dynamics(
    estimates = estimates, # data is now the estimates or a data frame ob similar structure
    fit = fit,
    distance = "euclidean", # which distance method should be used
    agglomeration = "ward.D2", # which agglomeration method for hierarchical clustering should be used
    deepSplit = 2, # sensitivity of cluster analysis,
    minClusterSize = 1, # minimum number of metabolites in one cluster
    B = 10, # number of bootstrapps
    )

##  ..cutHeight not given, setting it to 1.19  ===>  99% of the (truncated) height range in dendro.
##  ..done.
##  ..cutHeight not given, setting it to 1.84  ===>  99% of the (truncated) height range in dendro.
##  ..done.

To visualize the clustering results we can use the function plot_cluster which returns a list of plots. For details see Vignette “MetaboDynamics”.

plots <- plot_cluster(cluster)

## Scale for y is already present.
## Adding another scale for y, which will replace the existing scale.
## Scale for y is already present.
## Adding another scale for y, which will replace the existing scale.

plots$trees$A

plots$clusterplots$A

plots$lineplots$A

plots$patchwork$A

7 Over-Representation Analysis (ORA)

To conduct the over-representation analysis KEGG IDs of the experimental metabolites are needed. Additionally, information on all metabolites that are stored in the KEGG database is needed. This KEGG background information is stored in the data set “modules_compounds”. To apply ORA to a specific dataset a second dataset “metabolite_modules” is needed that can be obtained by filtering “modules_compounds” for the experimental metabolites.

The clustering result also needs to be converted from a list to a single data frame.

# load background dataset
data("modules_compounds")
data("metabolite_modules")
# get KEGGs from data
data("IDs")

The over-representation analysis and visualization can be achieved with the following code:

ora <- # store ORA result to separate object when using data frames as input
  ORA_hypergeometric(
    background = modules_compounds,
    annotations = metabolite_modules,
    data = cluster, # dataframe format of clustering output
    tested_column = "middle_hierarchy",
    IDs = IDs
  )

# Visualization
plot_ORA(
  data = ora,
  tested_column = "middle_hierarchy"
)

## $plot

## Warning in min(x): no non-missing arguments to min; returning Inf

## Warning in max(x): no non-missing arguments to max; returning -Inf

## Warning in min(x): no non-missing arguments to min; returning Inf

## Warning in max(x): no non-missing arguments to max; returning -Inf

## 
## $ora_patchwork
## list()

8 Compare dynamics clusters

For the comparison of dynamics clusters between experimental conditions one can employ two apporaches: comparing dynamics between clusters and comparing metabolite composition between clusters.

8.1 Dynamics

To compare dynamics between clusters awhile using a data frame as input use the following code:

comparison_dynamics <- # result needs to be stored in a separate object when using data frames
  compare_dynamics(
    data = cluster,
    # clustering result
    cores = 1 # only set to 1 for vignette, can be increased to 4 for parallelization
  )

## 
## SAMPLING FOR MODEL 'm_cluster_distances_padded' NOW (CHAIN 1).
## Chain 1: 
## Chain 1: Gradient evaluation took 1.4e-05 seconds
## Chain 1: 1000 transitions using 10 leapfrog steps per transition would take 0.14 seconds.
## Chain 1: Adjust your expectations accordingly!
## Chain 1: 
## Chain 1: 
## Chain 1: Iteration:    1 / 2000 [  0%]  (Warmup)
## Chain 1: Iteration:  200 / 2000 [ 10%]  (Warmup)
## Chain 1: Iteration:  400 / 2000 [ 20%]  (Warmup)
## Chain 1: Iteration:  501 / 2000 [ 25%]  (Sampling)
## Chain 1: Iteration:  700 / 2000 [ 35%]  (Sampling)
## Chain 1: Iteration:  900 / 2000 [ 45%]  (Sampling)
## Chain 1: Iteration: 1100 / 2000 [ 55%]  (Sampling)
## Chain 1: Iteration: 1300 / 2000 [ 65%]  (Sampling)
## Chain 1: Iteration: 1500 / 2000 [ 75%]  (Sampling)
## Chain 1: Iteration: 1700 / 2000 [ 85%]  (Sampling)
## Chain 1: Iteration: 1900 / 2000 [ 95%]  (Sampling)
## Chain 1: Iteration: 2000 / 2000 [100%]  (Sampling)
## Chain 1: 
## Chain 1:  Elapsed Time: 0.042 seconds (Warm-up)
## Chain 1:                0.094 seconds (Sampling)
## Chain 1:                0.136 seconds (Total)
## Chain 1: 
## 
## SAMPLING FOR MODEL 'm_cluster_distances_padded' NOW (CHAIN 2).
## Chain 2: 
## Chain 2: Gradient evaluation took 9e-06 seconds
## Chain 2: 1000 transitions using 10 leapfrog steps per transition would take 0.09 seconds.
## Chain 2: Adjust your expectations accordingly!
## Chain 2: 
## Chain 2: 
## Chain 2: Iteration:    1 / 2000 [  0%]  (Warmup)
## Chain 2: Iteration:  200 / 2000 [ 10%]  (Warmup)
## Chain 2: Iteration:  400 / 2000 [ 20%]  (Warmup)
## Chain 2: Iteration:  501 / 2000 [ 25%]  (Sampling)
## Chain 2: Iteration:  700 / 2000 [ 35%]  (Sampling)
## Chain 2: Iteration:  900 / 2000 [ 45%]  (Sampling)
## Chain 2: Iteration: 1100 / 2000 [ 55%]  (Sampling)
## Chain 2: Iteration: 1300 / 2000 [ 65%]  (Sampling)
## Chain 2: Iteration: 1500 / 2000 [ 75%]  (Sampling)
## Chain 2: Iteration: 1700 / 2000 [ 85%]  (Sampling)
## Chain 2: Iteration: 1900 / 2000 [ 95%]  (Sampling)
## Chain 2: Iteration: 2000 / 2000 [100%]  (Sampling)
## Chain 2: 
## Chain 2:  Elapsed Time: 0.048 seconds (Warm-up)
## Chain 2:                0.093 seconds (Sampling)
## Chain 2:                0.141 seconds (Total)
## Chain 2: 
## 
## SAMPLING FOR MODEL 'm_cluster_distances_padded' NOW (CHAIN 3).
## Chain 3: 
## Chain 3: Gradient evaluation took 8e-06 seconds
## Chain 3: 1000 transitions using 10 leapfrog steps per transition would take 0.08 seconds.
## Chain 3: Adjust your expectations accordingly!
## Chain 3: 
## Chain 3: 
## Chain 3: Iteration:    1 / 2000 [  0%]  (Warmup)
## Chain 3: Iteration:  200 / 2000 [ 10%]  (Warmup)
## Chain 3: Iteration:  400 / 2000 [ 20%]  (Warmup)
## Chain 3: Iteration:  501 / 2000 [ 25%]  (Sampling)
## Chain 3: Iteration:  700 / 2000 [ 35%]  (Sampling)
## Chain 3: Iteration:  900 / 2000 [ 45%]  (Sampling)
## Chain 3: Iteration: 1100 / 2000 [ 55%]  (Sampling)
## Chain 3: Iteration: 1300 / 2000 [ 65%]  (Sampling)
## Chain 3: Iteration: 1500 / 2000 [ 75%]  (Sampling)
## Chain 3: Iteration: 1700 / 2000 [ 85%]  (Sampling)
## Chain 3: Iteration: 1900 / 2000 [ 95%]  (Sampling)
## Chain 3: Iteration: 2000 / 2000 [100%]  (Sampling)
## Chain 3: 
## Chain 3:  Elapsed Time: 0.045 seconds (Warm-up)
## Chain 3:                0.096 seconds (Sampling)
## Chain 3:                0.141 seconds (Total)
## Chain 3: 
## 
## SAMPLING FOR MODEL 'm_cluster_distances_padded' NOW (CHAIN 4).
## Chain 4: 
## Chain 4: Gradient evaluation took 7e-06 seconds
## Chain 4: 1000 transitions using 10 leapfrog steps per transition would take 0.07 seconds.
## Chain 4: Adjust your expectations accordingly!
## Chain 4: 
## Chain 4: 
## Chain 4: Iteration:    1 / 2000 [  0%]  (Warmup)
## Chain 4: Iteration:  200 / 2000 [ 10%]  (Warmup)
## Chain 4: Iteration:  400 / 2000 [ 20%]  (Warmup)
## Chain 4: Iteration:  501 / 2000 [ 25%]  (Sampling)
## Chain 4: Iteration:  700 / 2000 [ 35%]  (Sampling)
## Chain 4: Iteration:  900 / 2000 [ 45%]  (Sampling)
## Chain 4: Iteration: 1100 / 2000 [ 55%]  (Sampling)
## Chain 4: Iteration: 1300 / 2000 [ 65%]  (Sampling)
## Chain 4: Iteration: 1500 / 2000 [ 75%]  (Sampling)
## Chain 4: Iteration: 1700 / 2000 [ 85%]  (Sampling)
## Chain 4: Iteration: 1900 / 2000 [ 95%]  (Sampling)
## Chain 4: Iteration: 2000 / 2000 [100%]  (Sampling)
## Chain 4: 
## Chain 4:  Elapsed Time: 0.043 seconds (Warm-up)
## Chain 4:                0.169 seconds (Sampling)
## Chain 4:                0.212 seconds (Total)
## Chain 4:

## Warning: There were 5 divergent transitions after warmup. See
## https://mc-stan.org/misc/warnings.html#divergent-transitions-after-warmup
## to find out why this is a problem and how to eliminate them.

## Warning: Examine the pairs() plot to diagnose sampling problems

The data frame needed for visualization of the results is the list element [[“estimates”]] of the function results. To visualize the results run the following code:

# Visualize comparison results
heatmap_dynamics(
  estimates = comparison_dynamics[["estimates"]],
  data = cluster
)

8.2 Metabolites

The comparison of metabolite composition follows the same principle as the comparison of dynamics between clusters:

# compare metabolite composition
compare_metabolites <-
  compare_metabolites(
    data = cluster
  )

# Visualization
heatmap_metabolites(
  distances = compare_metabolites,
  data = cluster
)

8.3 Combine both

The combination of both comparisons may facilitate detection of differences of longitudinal metabolomes between experimental conditions.

# combine comparison results
temp <- left_join(
  comparison_dynamics[["estimates"]], # dynamics comparison
  compare_metabolites,
  join_by("cluster_a", "cluster_b") # join by cluster comparisons
)

# get unique clusters
x <- unique(c(temp[, "cluster_a"], temp[, "cluster_b"]))

# draw plot
ggplot(temp, aes(x = cluster_b, y = cluster_a)) +
  geom_point(aes(size = Jaccard, col = mu_mean)) +
  theme_bw() +
  scale_color_viridis_c(option = "magma") +
  scale_x_discrete(limits = x) +
  xlab("") +
  ylab("") +
  scale_y_discrete(limits = x) +
  theme(axis.text.x = element_text(angle = 90, vjust = 0.5, hjust = 1)) +
  labs(col = "dynamics distance", size = "metabolite similarity") +
  ggtitle("comparison of clusters", "label = condition + cluster ID")

sessionInfo()

## R Under development (unstable) (2025-10-20 r88955)
## Platform: x86_64-pc-linux-gnu
## Running under: Ubuntu 24.04.3 LTS
## 
## Matrix products: default
## BLAS:   /home/biocbuild/bbs-3.23-bioc/R/lib/libRblas.so 
## LAPACK: /usr/lib/x86_64-linux-gnu/lapack/liblapack.so.3.12.0  LAPACK version 3.12.0
## 
## locale:
##  [1] LC_CTYPE=en_US.UTF-8       LC_NUMERIC=C              
##  [3] LC_TIME=en_GB              LC_COLLATE=C              
##  [5] LC_MONETARY=en_US.UTF-8    LC_MESSAGES=en_US.UTF-8   
##  [7] LC_PAPER=en_US.UTF-8       LC_NAME=C                 
##  [9] LC_ADDRESS=C               LC_TELEPHONE=C            
## [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C       
## 
## time zone: America/New_York
## tzcode source: system (glibc)
## 
## attached base packages:
## [1] stats4    stats     graphics  grDevices utils     datasets  methods  
## [8] base     
## 
## other attached packages:
##  [1] patchwork_1.3.2             tidyr_1.3.1                
##  [3] dplyr_1.1.4                 ggplot2_4.0.0              
##  [5] SummarizedExperiment_1.41.0 Biobase_2.71.0             
##  [7] GenomicRanges_1.63.0        Seqinfo_1.1.0              
##  [9] IRanges_2.45.0              S4Vectors_0.49.0           
## [11] BiocGenerics_0.57.0         generics_0.1.4             
## [13] MatrixGenerics_1.23.0       matrixStats_1.5.0          
## [15] MetaboDynamics_2.1.0        BiocStyle_2.39.0           
## 
## loaded via a namespace (and not attached):
##  [1] gridExtra_2.3           inline_0.3.21           rlang_1.1.6            
##  [4] magrittr_2.0.4          compiler_4.6.0          loo_2.8.0              
##  [7] systemfonts_1.3.1       png_0.1-8               vctrs_0.6.5            
## [10] stringr_1.5.2           pkgconfig_2.0.3         crayon_1.5.3           
## [13] fastmap_1.2.0           magick_2.9.0            XVector_0.51.0         
## [16] labeling_0.4.3          utf8_1.2.6              rmarkdown_2.30         
## [19] tinytex_0.57            purrr_1.1.0             xfun_0.54              
## [22] cachem_1.1.0            aplot_0.2.9             jsonlite_2.0.0         
## [25] DelayedArray_0.37.0     parallel_4.6.0          R6_2.6.1               
## [28] bslib_0.9.0             stringi_1.8.7           RColorBrewer_1.1-3     
## [31] StanHeaders_2.32.10     jquerylib_0.1.4         Rcpp_1.1.0             
## [34] bookdown_0.45           rstan_2.32.7            knitr_1.50             
## [37] Matrix_1.7-4            tidyselect_1.2.1        dichromat_2.0-0.1      
## [40] abind_1.4-8             yaml_2.3.10             codetools_0.2-20       
## [43] curl_7.0.0              pkgbuild_1.4.8          lattice_0.22-7         
## [46] tibble_3.3.0            treeio_1.35.0           withr_3.0.2            
## [49] KEGGREST_1.51.0         S7_0.2.0                evaluate_1.0.5         
## [52] gridGraphics_0.5-1      RcppParallel_5.1.11-1   Biostrings_2.79.1      
## [55] pillar_1.11.1           BiocManager_1.30.26     ggtree_4.1.1           
## [58] ggfun_0.2.0             rstantools_2.5.0        scales_1.4.0           
## [61] tidytree_0.4.6          glue_1.8.0              gdtools_0.4.4          
## [64] lazyeval_0.2.2          tools_4.6.0             ggiraph_0.9.2          
## [67] fs_1.6.6                grid_4.6.0              ape_5.8-1              
## [70] QuickJSR_1.8.1          nlme_3.1-168            cli_3.6.5              
## [73] rappdirs_0.3.3          fontBitstreamVera_0.1.1 S4Arrays_1.11.0        
## [76] viridisLite_0.4.2       V8_8.0.1                gtable_0.3.6           
## [79] yulab.utils_0.2.1       dynamicTreeCut_1.63-1   sass_0.4.10            
## [82] digest_0.6.37           fontquiver_0.2.1        SparseArray_1.11.1     
## [85] ggplotify_0.1.3         htmlwidgets_1.6.4       farver_2.1.2           
## [88] htmltools_0.5.8.1       lifecycle_1.0.4         httr_1.4.7             
## [91] fontLiberation_0.1.0

Using MetaboDynamics with data frames

31 October 2025

Package

Contents