estimraw

The goal of estimraw is to four-fold table cell frequencies (raw data) from risk ratios (relative risks), risk differences and odds ratios. While raw data can be useful for doing meta-analysis, such data is often not provided by primary studies (with summary statistics being solely presented). Therefore, based on summary statistics (namely, risk ratios, risk differences and odds ratios), this package estimates the value of each cell in a 2x2 table according to the equations described in Di Pietrantonj C (2006) (https://onlinelibrary.wiley.com/doi/abs/10.1002/sim.2287).

The following notation is used (see also table below):

Event No event Total
Exposed a b m1
Not exposed c d m2
Total e1

Estimating raw data from risk differences and odds ratio involves application of the quadratic formula. Therefore, for such effect sizes, two results sets (“solutions”) may be presented, particularly, when there is no information on the total number of participants developing the outcome event (e1). In those cases, the researcher must choose the most adequate solution based on prior clinical knowledge.

The precision with which effect sizes are provided conditions the uncertainty underlying raw data estimation. For example, effect sizes ranging from 1.45 to 1.54 can be reported as ‘1.5’. Therefore, when e1 is unknown, this function provides not only point estimates based on exact input values, but also a range of compatible values for a, b, c and d (considering possible roundings of effect sizes). When e1 is known, this function only provides the result(s) in which a+c=e1.

Installation

You can install the released version of estimraw from CRAN with:

install.packages("estimraw")

Getting started

estimraw allows for the use of estim_rawfunction, which has eight arguments:

The following outputs are obtained:

Examples

Example 1 - low effect size precision and no information on the number of participants developing the outcome of event:

Consider a primary study presenting the Risk Ratio as 0.6, with a 95% confidence interval of 0.4-0.9 (worked example 3.1. of Di Petrantonj C (Statistics in Medicine 2006;25:2299-2322)). Let us assume that (i) We know that there are 352 participants in the treated/exposed group, and 376 participants in the control group; (ii) we do not know the total number of participants developing the outcome of event.

Raw data can be estimated by:

estim_rr <- estim_raw(es=0.6,lb=0.4,ub=0.9,m1=352,m2=376,dec=1,measure="rr")
estim_rr

The results indicate that the value of a (corresponding to the number of treated patients who develop the outcome event) lies between 22 and 47, and that the value of c (corresponding to the number of control patients who develop the outcome event) lies between 38 and 87. This large range reflects the low estimate precision and lack of information on the number of participants developing the outcome event. The real values are actually 37 and 65.

Example 2 - high effect size precision and no information on the number of participants developing the outcome of event:

Consider a primary study presenting the Odds Ratio as 0.6207, with a 95% confidence interval of 0.3382-1.1391 (worked example 4.2. of Di Pietrantonj C (Statistics in Medicine 2006;25:2299-2322)). Let us assume that (i) We know that there are 355 participants in the treated/exposed group, and 366 participants in the control group; (ii) we do not know the total number of participants developing the outcome of event.

Raw data can be estimated by:

estim_or1 <- estim_raw(es=0.6207,lb=0.3382,ub=1.1391,m1=355,m2=366,dec=4,measure="or")
estim_or1

The results indicate that the values of a and c are either respectively 18 and 29 (solution 1) or 327 and 348 (solution 2). These two solutions are presented since the quadratic formula had been applied to estimate raw data. The correct solution should be assumed based on prior clinical knowledge (i.e., whether the outcome event is a probable one or not). The real values are actually a=18 and c=29.

Example 3 - low effect size precision and information on the number of participants developing the outcome of event:

Consider a primary study presenting the Odds Ratio as 0.6, with a 95% confidence interval of 0.3-1.1 (worked example 4.2. of Di Pietrantonj C (Statistics in Medicine 2006;25:2299-2322)). Let us assume that (i) We know that there are 355 participants in the treated/exposed group, and 366 participants in the control group; (ii) we know that, overall, 47 participants developed the outcome of event.

Raw data can be estimated by:

estim_or2 <- estim_raw(es=0.6,lb=0.3,ub=1.1,m1=355,m2=366,e1=47,dec=1,measure="or")
estim_or2

The results indicate that the values of a and c are either respectively 17 and 30 or 18 and 29. The real values are actually a=18 and c=29.

Example 4 - high effect size precision and information on the number of participants developing the outcome of event:

Consider a primary study presenting the risk difference as -7.83%, with a 95% confidence interval of -13.87;-1.8% (worked example 2.2. of Di Pietrantonj C (Statistics in Medicine 2006;25:2299-2322)). Let us assume that (i) We know that there are 373 participants in the treated/exposed group, and 357 participants in the control group; (ii) we know that, overall, 163 participants developed the outcome of event.

Raw data can be estimated by:

estim_rd <- estim_raw(es=-0.0783,lb=-0.1387,ub=-0.018,m1=373,m2=357,e1=163,dec=4,measure="rd")
estim_rd

The results indicate that the values of a and c are respectively 69 and 94. These actually correspond to the real values of a and c.

References