Title: | Find Differences Between R Objects |
---|---|
Description: | Compare complex R objects and reveal the key differences. Designed particularly for use in testing packages where being able to quickly isolate key differences makes understanding test failures much easier. |
Authors: | Hadley Wickham [aut, cre], Posit Software, PBC [cph, fnd] |
Maintainer: | Hadley Wickham <[email protected]> |
License: | MIT + file LICENSE |
Version: | 0.6.1.9000 |
Built: | 2024-12-07 06:25:35 UTC |
Source: | https://github.com/r-lib/waldo |
This compares two R objects, identifying the key differences. It:
Orders the differences from most important to least important.
Displays the values of atomic vectors that are actually different.
Carefully uses colour to emphasise changes (while still being readable when colour isn't available).
Uses R code (not a text description) to show where differences arise.
Where possible, it compares elements by name, rather than by position.
Errs on the side of producing too much output, rather than too little.
compare()
is an alternative to all.equal()
.
compare( x, y, ..., x_arg = "old", y_arg = "new", tolerance = NULL, max_diffs = if (in_ci()) Inf else 10, ignore_srcref = TRUE, ignore_attr = "waldo_opts", ignore_encoding = TRUE, ignore_function_env = FALSE, ignore_formula_env = FALSE, list_as_map = FALSE, quote_strings = TRUE )
compare( x, y, ..., x_arg = "old", y_arg = "new", tolerance = NULL, max_diffs = if (in_ci()) Inf else 10, ignore_srcref = TRUE, ignore_attr = "waldo_opts", ignore_encoding = TRUE, ignore_function_env = FALSE, ignore_formula_env = FALSE, list_as_map = FALSE, quote_strings = TRUE )
x , y
|
Objects to compare. |
... |
A handful of other arguments are supported with a warning for backward comparability. These include:
All other arguments are ignored with a warning. |
x_arg , y_arg
|
Name of |
tolerance |
If non- It uses the same algorithm as |
max_diffs |
Control the maximum number of differences shown. The
default shows 10 differences when run interactively and all differences
when run in CI. Set |
ignore_srcref |
Ignore differences in function |
ignore_attr |
Ignore differences in specified attributes?
Supply a character vector to ignore differences in named attributes.
By default the For backward compatibility with |
ignore_encoding |
Ignore string encoding? |
ignore_function_env , ignore_formula_env
|
Ignore the environments of
functions and formulas, respectively? These are provided primarily for
backward compatibility with |
list_as_map |
Compare lists as if they are mappings between names and
values. Concretely, this drops |
quote_strings |
Should strings be surrounded by quotes? If |
A character vector with class "waldo_compare". If there are no differences it will have length 0; otherwise each element contains the description of a single difference.
There are two ways for an object (rather than the person calling compare()
or expect_equal()
to control how it is compared to other objects.
First, if the object has an S3 class, you can provide a compare_proxy()
method that provides an alternative representation of the object; this is
particularly useful if important data is stored outside of R, e.g. in
an external pointer.
Alternatively, you can attach an attribute called "waldo_opts"
to your
object. This should be a list of compare options, using the same names
and possible values as the arguments to this function. This option
is ignored by default (ignore_attr
) so that you can set the options in
the object that you control. (If you don't want to see the attributes
interactively, you could attach them in a compare_proxy()
method.)
Options supplied in this way also affect all the children. This means options are applied in the following order, from lowest to highest precedence:
Defaults from compare()
.
The waldo_opts
for the parents of x
.
The waldo_opts
for the parents of y
.
The waldo_opts
for x
.
The waldo_opts
for y
.
User-specified arguments to compare()
.
Use these techniques with care. If you accidentally cover up an important
difference you can create a confusing situation where x
and y
behave
differently but compare()
reports no differences in the underlying objects.
# Thanks to diffobj package comparison of atomic vectors shows differences # with a little context compare(letters, c("z", letters[-26])) compare(c(1, 2, 3), c(1, 3)) compare(c(1, 2, 3), c(1, 3, 4, 5)) compare(c(1, 2, 3), c(1, 2, 5)) # More complex objects are traversed, stopping only when the types are # different compare( list(x = list(y = list(structure(1, z = 2)))), list(x = list(y = list(structure(1, z = "a")))) ) # Where possible, recursive structures are compared by name compare(iris, rev(iris)) compare(list(x = "x", y = "y"), list(y = "y", x = "x")) # Otherwise they're compared by position compare(list("x", "y"), list("x", "z")) compare(list(x = "x", x = "y"), list(x = "x", y = "z"))
# Thanks to diffobj package comparison of atomic vectors shows differences # with a little context compare(letters, c("z", letters[-26])) compare(c(1, 2, 3), c(1, 3)) compare(c(1, 2, 3), c(1, 3, 4, 5)) compare(c(1, 2, 3), c(1, 2, 5)) # More complex objects are traversed, stopping only when the types are # different compare( list(x = list(y = list(structure(1, z = 2)))), list(x = list(y = list(structure(1, z = "a")))) ) # Where possible, recursive structures are compared by name compare(iris, rev(iris)) compare(list(x = "x", y = "y"), list(y = "y", x = "x")) # Otherwise they're compared by position compare(list("x", "y"), list("x", "z")) compare(list(x = "x", x = "y"), list(x = "x", y = "z"))
Use this generic to override waldo's default comparison if you need to override the defaults (typically because your object stores data in an external pointer).
waldo comes with methods for a few common cases:
data.table: the .internal.selfref
and index
attributes
are set to NULL
. Both attributes are used for performance optimisation, and
don't affect the data.
xml2::xml_node
: the underlying XML data is stored in memory in C,
behind an external pointer, so the we best can do is to convert the
object to a string.
Classes from the RProtoBuf
package: like XML objects, these store
data in memory in C++ and only expose string names to R. Fortunately,
these have well-understood string representations that we can use for
comparisons. See
https://protobuf.dev/reference/cpp/api-docs/google.protobuf.text_format/
compare_proxy(x, path = "x")
compare_proxy(x, path = "x")
x |
An object. |
path |
Path |
A list with two components:
object
: the modified object
path
: an updated path showing what modification was applied