Title: | Explore the Innards of 'ggplot2' Objects |
---|---|
Description: | Extensions to 'ggplot2' providing low-level debug tools: statistics and geometries echoing their data argument. Layer manipulation: deletion, insertion, extraction and reordering of layers. Deletion of unused variables from the data object embedded in "ggplot" objects. |
Authors: | Pedro J. Aphalo [aut, cre] |
Maintainer: | Pedro J. Aphalo <[email protected]> |
License: | GPL (>= 2) |
Version: | 0.2.0-1 |
Built: | 2024-11-13 23:15:29 UTC |
Source: | https://github.com/aphalo/gginnards |
Extensions to 'ggplot2' providing low-level debug tools: statistics and geometries echoing their data argument. Layer manipulation: deletion, insertion, extraction and reordering of layers. Deletion of unused variables from the data object embedded in "ggplot" objects.
The new facilities for cleanly defining new stats and geoms added to package 'ggplot2' in version 2.0.0 gave origin to this package. I needed tools to help me learn how layers work and to debug the extensions to 'ggplot2' that I was developing. I share them through this package in the hope that they will help other users of 'ggplot2' understand how this this very popular graphics package works internally. The vignettes provide examples of how to use these tools both for debugging and learning how ggplots are stored.
Extensions provided:
"Debug" stats and a "debug" geom that print to the console a summary
of their data
input.
Functions for inspecting and manipulating the list of layers of a ggplot object.
Functions for exploring and manipulating the data embedded in ggplot objects, including dropping unused variables.
Maintainer: Pedro J. Aphalo [email protected] (ORCID)
Package 'tidyverse' web site at https://www.tidyverse.org/
Package 'ggplot2' documentation at https://ggplot2.tidyverse.org/
Package 'ggplot2' source code at https://github.com/tidyverse/ggplot2
Useful links:
Report bugs at https://github.com/aphalo/gginnards/issues
# echo the 'data' and 'params' as received by 'draw_panel()' ggplot(mtcars, aes(cyl, mpg, color = factor(cyl))) + geom_point() + geom_debug_panel() # echo the 'data' and 'params' as received by 'draw_group()' ggplot(mtcars, aes(cyl, mpg, color = factor(cyl))) + geom_point() + geom_debug_group() # echo the 'data' and 'params' as received by 'draw_panel()' ggplot(mtcars, aes(cyl, mpg, colour = factor(cyl))) + stat_summary(fun.data = "mean_se") + stat_summary(fun.data = "mean_se", geom = "debug_panel") # echo the 'data' received by 'compute_panel()' ggplot(mtcars, aes(cyl, mpg, color = factor(cyl))) + geom_point() + stat_debug_panel() # echo the 'data' received by 'compute_group()' ggplot(mtcars, aes(cyl, mpg, color = factor(cyl))) + geom_point() + stat_debug_group()
# echo the 'data' and 'params' as received by 'draw_panel()' ggplot(mtcars, aes(cyl, mpg, color = factor(cyl))) + geom_point() + geom_debug_panel() # echo the 'data' and 'params' as received by 'draw_group()' ggplot(mtcars, aes(cyl, mpg, color = factor(cyl))) + geom_point() + geom_debug_group() # echo the 'data' and 'params' as received by 'draw_panel()' ggplot(mtcars, aes(cyl, mpg, colour = factor(cyl))) + stat_summary(fun.data = "mean_se") + stat_summary(fun.data = "mean_se", geom = "debug_panel") # echo the 'data' received by 'compute_panel()' ggplot(mtcars, aes(cyl, mpg, color = factor(cyl))) + geom_point() + stat_debug_panel() # echo the 'data' received by 'compute_group()' ggplot(mtcars, aes(cyl, mpg, color = factor(cyl))) + geom_point() + stat_debug_group()
Delete, move or append one or more layers in a ggplot object.
delete_layers(x, match_type = NULL, idx = NULL) append_layers(x, object, position = "top") move_layers(x, match_type = NULL, position = "top", idx = NULL) shift_layers(x, match_type = NULL, idx = NULL, shift = 1L) which_layers(x, match_type = NULL, idx = NULL) extract_layers(x, match_type = NULL, idx = NULL) top_layer(x) bottom_layer(x) num_layers(x)
delete_layers(x, match_type = NULL, idx = NULL) append_layers(x, object, position = "top") move_layers(x, match_type = NULL, position = "top", idx = NULL) shift_layers(x, match_type = NULL, idx = NULL, shift = 1L) which_layers(x, match_type = NULL, idx = NULL) extract_layers(x, match_type = NULL, idx = NULL) top_layer(x) bottom_layer(x) num_layers(x)
x |
an object of class |
match_type |
The name of the ggproto object class for the geom(s), position(s) or stat(s) matching that of the layers to be operated upon. |
idx |
integer vector Index into the list of layers used to select the layers to be operated upon. |
object |
a ggplot layer created by a |
position |
character or interger, the position of the layer immediately above of which to move or append the moved or appended layers. |
shift |
integer. |
These functions must be used with care as they select all layers matching the provided geom, position or stat ggproto object class. Layers added with a stat do use a geom, and vice versa.
One and only one of match_type
and idx
must be passed a
non-null argument.
In plots with several layers, it is possible that more than one layer
matches the class name passed to match_type
. It is also possible to
pass a numeric vector with multiple indexes through parameter idx
.
In both cases multiple layers will be operated upon, but their relative
positions will remain unchanged.
If a numeric vector with multiple position indexes is supplied as argument
for position
, the topmost position will be used. As indexing in R
starts at 1, passing 0 or "bottom"
as argument for position
puts the moved or appended layer(s) behind all other layers (prepends the
layer).
An edited copy of x
for delete_layers
,
append_layers
and move_layers
. An integer vector of indexes
giving the positions of the matching layers in the list of layers contained
in x
in the case of which_layers
.
The functions described here are not expected to be useful in everyday plotting as one can more easily change the order in which layers are added to a ggplot. However, if one uses high level methods or functions that automatically produce a full plot using 'ggplot2' internally, one may need to add, move or delete layers so as to profit from such canned methods and retain enough flexibility.
https://stackoverflow.com/questions/13407236/remove-a-layer-from-a-ggplot2-chart
df <- data.frame( gp = factor(rep(letters[1:3], each = 10)), y = rnorm(30) ) p <- ggplot(df, aes(gp, y)) + geom_point() + stat_summary(fun.data = "mean_se", colour = "red") p delete_layers(p, "GeomPoint") delete_layers(p, "StatSummary") move_layers(p, "GeomPoint", position = "top") move_layers(p, "GeomPointrange", position = "bottom") move_layers(p, "StatSummary", position = "bottom") move_layers(p, "GeomPointrange", position = 1L) append_layers(p, geom_line(colour = "orange"), position = "bottom") append_layers(p, geom_line(colour = "orange"), position = 1L) extract_layers(p, "GeomPoint") ggplot(df, aes(gp, y)) + extract_layers(p, "GeomPoint") which_layers(p, "GeomPoint") num_layers(p) top_layer(p) bottom_layer(p) num_layers(ggplot()) top_layer(ggplot()) bottom_layer(ggplot()) if (requireNamespace("sf", quietly = TRUE)) { nc <- sf::st_read(system.file("shape/nc.shp", package = "sf"), quiet = TRUE) nc_3857 <- sf::st_transform(nc, 3857) p.sf1 <- ggplot() + geom_sf(data = nc) p.sf1 num_layers(p.sf1) top_layer(p.sf1) append_layers(p.sf1, geom_sf(data = nc_3857, colour = "red", fill = NA), position = "top") p.sf2 <- ggplot() + geom_sf(data = nc) + geom_sf(data = nc_3857, colour = "red", fill = NA) p.sf2 num_layers(p.sf2) top_layer(p.sf2) delete_layers(p.sf2, idx = 2L) extract_layers(p.sf2, "GeomSf") extract_layers(p.sf2, "StatSf") extract_layers(p.sf2, idx = 1L) p.sf1 + extract_layers(p.sf2, idx = 2L) # beware that Coords are not extracted! ggplot() + extract_layers(p.sf2, idx = 2L) + coord_sf() }
df <- data.frame( gp = factor(rep(letters[1:3], each = 10)), y = rnorm(30) ) p <- ggplot(df, aes(gp, y)) + geom_point() + stat_summary(fun.data = "mean_se", colour = "red") p delete_layers(p, "GeomPoint") delete_layers(p, "StatSummary") move_layers(p, "GeomPoint", position = "top") move_layers(p, "GeomPointrange", position = "bottom") move_layers(p, "StatSummary", position = "bottom") move_layers(p, "GeomPointrange", position = 1L) append_layers(p, geom_line(colour = "orange"), position = "bottom") append_layers(p, geom_line(colour = "orange"), position = 1L) extract_layers(p, "GeomPoint") ggplot(df, aes(gp, y)) + extract_layers(p, "GeomPoint") which_layers(p, "GeomPoint") num_layers(p) top_layer(p) bottom_layer(p) num_layers(ggplot()) top_layer(ggplot()) bottom_layer(ggplot()) if (requireNamespace("sf", quietly = TRUE)) { nc <- sf::st_read(system.file("shape/nc.shp", package = "sf"), quiet = TRUE) nc_3857 <- sf::st_transform(nc, 3857) p.sf1 <- ggplot() + geom_sf(data = nc) p.sf1 num_layers(p.sf1) top_layer(p.sf1) append_layers(p.sf1, geom_sf(data = nc_3857, colour = "red", fill = NA), position = "top") p.sf2 <- ggplot() + geom_sf(data = nc) + geom_sf(data = nc_3857, colour = "red", fill = NA) p.sf2 num_layers(p.sf2) top_layer(p.sf2) delete_layers(p.sf2, idx = 2L) extract_layers(p.sf2, "GeomSf") extract_layers(p.sf2, "StatSf") extract_layers(p.sf2, idx = 1L) p.sf1 + extract_layers(p.sf2, idx = 2L) # beware that Coords are not extracted! ggplot() + extract_layers(p.sf2, idx = 2L) + coord_sf() }
Automatically remove unused variables from the "default" data object embedded
in a gg
or ggplot
object with drop_vars()
. Explore data
variables and their use with mapped_vars()
, data_vars()
and
data_attributes()
.
drop_vars(p, keep.vars = character(), guess.vars = TRUE) mapped_vars(p, invert = FALSE) data_vars(p) data_attributes(p)
drop_vars(p, keep.vars = character(), guess.vars = TRUE) mapped_vars(p, invert = FALSE) data_vars(p) data_attributes(p)
p |
ggplot Plot object with embedded data. |
keep.vars |
character Names of unused variables to be kept. |
guess.vars |
logical Flag indicating whether to find used variables automatically. |
invert |
logical If TRUE return indices for elements of |
A "ggplot"
object that is a copy of p
but containing
only a subset of the variables in its default data
.
character vector with names of mapped variables in the default data object.
character vector with names of all variables in the default data object.
list containing all attributes of the default data object.
The current implementation drops variables only from the default data object. Data objects within layers are not modified.
These functions are under development and not yet thoroughly tested!
They are a demonstration of how one can manipulate the internals of
ggplot
objects creayed with 'ggplot2' versions 3.1.0 and later.
These functions may stop working after some future update to the 'ggplot2'
package. Although I will maintain this package for use in some of my other
packages, there is no guarantee that I will be able to achieve this
transparently.
Obviously, rather than using function drop_vars()
after creating the
ggplot
object it is usually more efficient to select the variables
of interest and pass a data frame containing only these to the
ggplot()
constructor.
library(ggplot2) p <- ggplot(mpg, aes(factor(year), (cty + hwy) / 2)) + geom_boxplot() + facet_grid(. ~ class) mapped_vars(p) # those in use mapped_vars(p, invert = TRUE) # those not used p.dp <- drop_vars(p) # we drop unused vars # number of columns in the data member ncol(p$data) ncol(p.dp$data) # which vars are in the data member data_vars(p) data_vars(p.dp) # which variables in data are used in the plot mapped_vars(p) mapped_vars(p.dp) # the plots identical p p.dp # structure and size of p str(p, max.level = 0) str(p.dp, max.level = 0) # smaller in size # structure and size of p["data"] str(p, components = "data") str(p.dp, components = "data") # smaller in size # shape data if (requireNamespace("sf", quietly = TRUE)) { nc <- sf::st_read(system.file("shape/nc.shp", package = "sf"), quiet = TRUE) p.sf <- ggplot(data = nc) + geom_sf() p.sf mapped_vars(p.sf) drop_vars(p.sf) }
library(ggplot2) p <- ggplot(mpg, aes(factor(year), (cty + hwy) / 2)) + geom_boxplot() + facet_grid(. ~ class) mapped_vars(p) # those in use mapped_vars(p, invert = TRUE) # those not used p.dp <- drop_vars(p) # we drop unused vars # number of columns in the data member ncol(p$data) ncol(p.dp$data) # which vars are in the data member data_vars(p) data_vars(p.dp) # which variables in data are used in the plot mapped_vars(p) mapped_vars(p.dp) # the plots identical p p.dp # structure and size of p str(p, max.level = 0) str(p.dp, max.level = 0) # smaller in size # structure and size of p["data"] str(p, components = "data") str(p.dp, components = "data") # smaller in size # shape data if (requireNamespace("sf", quietly = TRUE)) { nc <- sf::st_read(system.file("shape/nc.shp", package = "sf"), quiet = TRUE) p.sf <- ggplot(data = nc) + geom_sf() p.sf mapped_vars(p.sf) drop_vars(p.sf) }
The debug geoms are used to print to the console a summary of the object being
received by geoms' draw functions as input data
and parameters
objects.
geom_debug_panel( mapping = NULL, data = NULL, stat = "identity", dbgfun.data = "head", dbgfun.data.args = list(), dbgfun.params = "summary", dbgfun.params.args = list(), dbgfun.print = "print", dbgfun.print.args = list(), parse = NULL, orientation = NULL, nudge_x = 0, nudge_y = 0, position = "identity", na.rm = FALSE, show.legend = FALSE, inherit.aes = TRUE, ... ) geom_debug( mapping = NULL, data = NULL, stat = "identity", summary.fun = "head", summary.fun.args = list(), parse = NULL, orientation = NULL, nudge_x = 0, nudge_y = 0, position = "identity", na.rm = FALSE, show.legend = FALSE, inherit.aes = TRUE, ... ) geom_debug_group( mapping = NULL, data = NULL, stat = "identity", dbgfun.data = "head", dbgfun.data.args = list(), dbgfun.params = "summary", dbgfun.params.args = list(), dbgfun.print = "print", dbgfun.print.args = list(), parse = NULL, orientation = NULL, nudge_x = 0, nudge_y = 0, position = "identity", na.rm = FALSE, show.legend = FALSE, inherit.aes = TRUE, ... )
geom_debug_panel( mapping = NULL, data = NULL, stat = "identity", dbgfun.data = "head", dbgfun.data.args = list(), dbgfun.params = "summary", dbgfun.params.args = list(), dbgfun.print = "print", dbgfun.print.args = list(), parse = NULL, orientation = NULL, nudge_x = 0, nudge_y = 0, position = "identity", na.rm = FALSE, show.legend = FALSE, inherit.aes = TRUE, ... ) geom_debug( mapping = NULL, data = NULL, stat = "identity", summary.fun = "head", summary.fun.args = list(), parse = NULL, orientation = NULL, nudge_x = 0, nudge_y = 0, position = "identity", na.rm = FALSE, show.legend = FALSE, inherit.aes = TRUE, ... ) geom_debug_group( mapping = NULL, data = NULL, stat = "identity", dbgfun.data = "head", dbgfun.data.args = list(), dbgfun.params = "summary", dbgfun.params.args = list(), dbgfun.print = "print", dbgfun.print.args = list(), parse = NULL, orientation = NULL, nudge_x = 0, nudge_y = 0, position = "identity", na.rm = FALSE, show.legend = FALSE, inherit.aes = TRUE, ... )
mapping |
Set of aesthetic mappings created by
|
data |
A data frame. If specified, overrides the default data frame defined at the top level of the plot. |
stat |
The statistical transformation to use on the data for this layer, as a string. |
dbgfun.data , dbgfun.params , summary.fun
|
The functions as character
strings giving their names or as named or anonymous function objects, to be
used to summarize the |
dbgfun.data.args , dbgfun.params.args , summary.fun.args
|
A named list of
additional arguments to be passed to |
dbgfun.print |
A function used to print the |
dbgfun.print.args |
A named list. Currently ignored! |
parse , orientation
|
Ignored. Helps avoid warnings. |
nudge_x , nudge_y
|
Horizontal and vertical adjustments to nudge the
starting position. The units for |
position |
Position adjustment, either as a string, or the result of a call to a position adjustment function. |
na.rm |
If |
show.legend |
logical. Should this layer be included in the legends?
|
inherit.aes |
If |
... |
other arguments passed on to
|
It can be useful when debugging the code of statistics or to learn how the stats and geoms work in 'ggplot2' (>= 3.0.0).
The intended use of geom_debug_panel()
and
geom_debug_group()
is to explore the data as
they are used in a plot layer to produce graphical output. Geometries
can be defined using draw functions that receive as input data corresponding
to a single group at a time, or draw functions that receive as input all
data to be drawn in a panel at a time, possibly including multiple
groups. Function geom_debug()
is identical to
geom_debug_panel()
, and included for backwards compatibility.
These debug geoms are very unusual in that they do not produce
visible graphic output. They "draw" a grid.null()
grob (graphical
object) when the plot is rendered. Also, differently to normal geometries,
they print the data
and params
objects or a summary of them
to the R console. The summary is obtained using the functions passed as
arguments to their formal parameter dbgfun.data
and
dbgfun.params
. The data
and params
objects are passed
as the first positional argument to these functions and the values they
return are printed.
If dbgfun.data = "I"
is passed, the data
object is printed as
is. In contrast, if dbgfun.data = NULL
is passed, the data
object summary and its printing are not skipped. The mechanism is identical
for dbgfun.params
and params
.
Nudging with nudge_x
and nudge_y
behave as in
geom_text
. Arguments passed to position
are
obeyed. So the effects of positions are reflected in the data
object
printed or summarized to the R console. The arguments passed to
parse
and orientation
are currently ignored.
Many aesthetics are defined as optional so that they are accepted silently
by geom_debug()
and handled by 'ggplot2' as usual. Given the number
available extensions to 'ggplot2', it is likely that some are missing.
The names of dbgfun.data()
and dbgfun.params()
are included
in the section headers of the printout, together with panels and groups.
In most cases, the definitions of the debug and print functions must be
available when the "gg"
object is printed and the plot rendered.
The panel function of this geometry always returns a
nullGrob
, the legend is is also set to
nullGrob
. This geometry used for its text printing side
effect.
geom_debug()
is a synonym of geom_debug_panel()
, for
backwards compatibility. Not to be used in new code.
To access data, scales and grobs in a built ggplot, see
ggplot_build
.
# echo to the R console \code{data} as received by geoms ggplot(mtcars, aes(cyl, mpg, color = factor(cyl))) + geom_point() + geom_debug_panel() ggplot(mtcars, aes(cyl, mpg, color = factor(cyl))) + geom_point() + geom_debug_group() ggplot(mtcars, aes(cyl, mpg, color = factor(cyl))) + geom_point() + geom_debug_panel(dbgfun.params = NULL) ggplot(mtcars, aes(cyl, mpg, color = factor(cyl))) + geom_point() + geom_debug_panel(dbgfun.data = NULL) ggplot(mtcars, aes(cyl, mpg, color = factor(cyl))) + geom_point() + geom_debug_panel(dbgfun.data = "head", dbgfun.data.args = list(n = 3)) ggplot(mtcars, aes(cyl, mpg, color = factor(cyl))) + geom_point() + geom_debug_panel(dbgfun.data = "nrow", dbgfun.params = "length") ggplot(mtcars, aes(cyl, mpg, color = factor(cyl))) + geom_point() + geom_debug_panel(dbgfun.data = "attributes", dbgfun.params = "attributes") ggplot(mtcars, aes(cyl, mpg, color = factor(cyl))) + geom_point() + geom_debug_panel(dbgfun.data = "I", dbgfun.params = NULL) # echo to the R console \code{data} as received by geoms ggplot(mtcars, aes(cyl, mpg, colour = factor(cyl))) + stat_summary(fun.data = "mean_se") + stat_summary(geom = "debug_panel", fun.data = "mean_se") ggplot(mtcars, aes(cyl, mpg, colour = factor(cyl))) + stat_summary(fun.data = "mean_se") + stat_summary(geom = "debug_panel", fun.data = "mean_se", dbgfun.params = NULL) # shape data is not passed to geometries or statistics if (requireNamespace("sf", quietly = TRUE)) { nc <- sf::st_read(system.file("shape/nc.shp", package = "sf"), quiet = TRUE) ggplot(data = nc) + geom_sf(color = "darkblue", fill = "white") + geom_debug_panel() } # backwards compatibility ggplot(mtcars, aes(cyl, mpg, color = factor(cyl))) + geom_point() + geom_debug() ggplot(mtcars, aes(cyl, mpg, colour = factor(cyl))) + stat_summary(fun.data = "mean_se") + stat_summary(geom = "debug", fun.data = "mean_se")
# echo to the R console \code{data} as received by geoms ggplot(mtcars, aes(cyl, mpg, color = factor(cyl))) + geom_point() + geom_debug_panel() ggplot(mtcars, aes(cyl, mpg, color = factor(cyl))) + geom_point() + geom_debug_group() ggplot(mtcars, aes(cyl, mpg, color = factor(cyl))) + geom_point() + geom_debug_panel(dbgfun.params = NULL) ggplot(mtcars, aes(cyl, mpg, color = factor(cyl))) + geom_point() + geom_debug_panel(dbgfun.data = NULL) ggplot(mtcars, aes(cyl, mpg, color = factor(cyl))) + geom_point() + geom_debug_panel(dbgfun.data = "head", dbgfun.data.args = list(n = 3)) ggplot(mtcars, aes(cyl, mpg, color = factor(cyl))) + geom_point() + geom_debug_panel(dbgfun.data = "nrow", dbgfun.params = "length") ggplot(mtcars, aes(cyl, mpg, color = factor(cyl))) + geom_point() + geom_debug_panel(dbgfun.data = "attributes", dbgfun.params = "attributes") ggplot(mtcars, aes(cyl, mpg, color = factor(cyl))) + geom_point() + geom_debug_panel(dbgfun.data = "I", dbgfun.params = NULL) # echo to the R console \code{data} as received by geoms ggplot(mtcars, aes(cyl, mpg, colour = factor(cyl))) + stat_summary(fun.data = "mean_se") + stat_summary(geom = "debug_panel", fun.data = "mean_se") ggplot(mtcars, aes(cyl, mpg, colour = factor(cyl))) + stat_summary(fun.data = "mean_se") + stat_summary(geom = "debug_panel", fun.data = "mean_se", dbgfun.params = NULL) # shape data is not passed to geometries or statistics if (requireNamespace("sf", quietly = TRUE)) { nc <- sf::st_read(system.file("shape/nc.shp", package = "sf"), quiet = TRUE) ggplot(data = nc) + geom_sf(color = "darkblue", fill = "white") + geom_debug_panel() } # backwards compatibility ggplot(mtcars, aes(cyl, mpg, color = factor(cyl))) + geom_point() + geom_debug() ggplot(mtcars, aes(cyl, mpg, colour = factor(cyl))) + stat_summary(fun.data = "mean_se") + stat_summary(geom = "debug", fun.data = "mean_se")
The null geom can be used to silence graphic output from a stat, such as
stat_debug_group()
and stat_debug_panel()
defined in this same
package. No visible graphical output is returned. An invisible
grid::grid_null()
grob is returned instead.
geom_null( mapping = NULL, data = NULL, stat = "identity", position = "identity", na.rm = FALSE, show.legend = FALSE, inherit.aes = TRUE, ... )
geom_null( mapping = NULL, data = NULL, stat = "identity", position = "identity", na.rm = FALSE, show.legend = FALSE, inherit.aes = TRUE, ... )
mapping |
Set of aesthetic mappings created by
|
data |
A data frame. If specified, overrides the default data frame defined at the top level of the plot. |
stat |
The statistical transformation to use on the data for this layer, as a string. |
position |
Position adjustment, either as a string, or the result of a call to a position adjustment function. |
na.rm |
If |
show.legend |
logical. Should this layer be included in the legends?
|
inherit.aes |
If |
... |
other arguments passed on to
|
A plot layer instance. Mainly used for the side-effect of printing
to the console the data
object.
This geom is very unusual in that it does not produce visible graphic
output. It only returns a grid.null
grob (graphical
object). However, it accepts for consistency all the same parameters as
normal geoms, which have no effect on the graphical output, except for
show.legend
.
ggplot(mtcars) + geom_null() ggplot(mtcars, aes(cyl, mpg)) + geom_null() # shape data if (requireNamespace("sf", quietly = TRUE)) { nc <- sf::st_read(system.file("shape/nc.shp", package = "sf"), quiet = TRUE) ggplot(data = nc) + geom_null() }
ggplot(mtcars) + geom_null() ggplot(mtcars, aes(cyl, mpg)) + geom_null() # shape data if (requireNamespace("sf", quietly = TRUE)) { nc <- sf::st_read(system.file("shape/nc.shp", package = "sf"), quiet = TRUE) ggplot(data = nc) + geom_null() }
stat_debug_group
and stat_debug_panel
apply a function to data
and, most importantly,
echo to the R console their data
input or a summary of it.
stat_debug_group( mapping = NULL, data = NULL, geom = "null", fun.data = "I", fun.data.args = list(), dbgfun.data = "head", dbgfun.data.args = list(), geom.dbgfun.data = "head", geom.dbgfun.data.args = list(), geom.dbgfun.params = NULL, geom.dbgfun.params.args = list(), dbgfun.print = "print", dbgfun.print.args = list(), position = "identity", na.rm = FALSE, show.legend = FALSE, inherit.aes = TRUE, ... ) stat_debug_panel( mapping = NULL, data = NULL, geom = "null", fun.data = "I", fun.data.args = list(), dbgfun.data = "head", dbgfun.data.args = list(), geom.dbgfun.data = "head", geom.dbgfun.data.args = list(), geom.dbgfun.params = NULL, geom.dbgfun.params.args = list(), dbgfun.print = "print", dbgfun.print.args = list(), position = "identity", na.rm = FALSE, show.legend = FALSE, inherit.aes = TRUE, ... )
stat_debug_group( mapping = NULL, data = NULL, geom = "null", fun.data = "I", fun.data.args = list(), dbgfun.data = "head", dbgfun.data.args = list(), geom.dbgfun.data = "head", geom.dbgfun.data.args = list(), geom.dbgfun.params = NULL, geom.dbgfun.params.args = list(), dbgfun.print = "print", dbgfun.print.args = list(), position = "identity", na.rm = FALSE, show.legend = FALSE, inherit.aes = TRUE, ... ) stat_debug_panel( mapping = NULL, data = NULL, geom = "null", fun.data = "I", fun.data.args = list(), dbgfun.data = "head", dbgfun.data.args = list(), geom.dbgfun.data = "head", geom.dbgfun.data.args = list(), geom.dbgfun.params = NULL, geom.dbgfun.params.args = list(), dbgfun.print = "print", dbgfun.print.args = list(), position = "identity", na.rm = FALSE, show.legend = FALSE, inherit.aes = TRUE, ... )
mapping |
The aesthetic mapping, usually constructed with
|
data |
A layer specific dataset - only needed if you want to override the plot defaults. |
geom |
The geometric object to use display the data |
fun.data |
A function taking a data frame as its first argument and returning a data frame. This function does the computations generating the value passed from the statistic to the downstream geometry. |
fun.data.args |
A named list of additional arguments to be passed to
|
dbgfun.data , geom.dbgfun.data , geom.dbgfun.params
|
A
functions used to summarise the |
dbgfun.data.args , geom.dbgfun.data.args , geom.dbgfun.params.args
|
A named list of arguments. |
dbgfun.print |
A function used to print the summary of the |
dbgfun.print.args |
A named list. Currently ignored! |
position |
The position adjustment to use for overlapping points on this layer |
na.rm |
a logical value indicating whether NA values should be stripped before the computation proceeds. |
show.legend |
logical. Should this layer be included in the legends?
|
inherit.aes |
If |
... |
other arguments passed on to |
These stats are meant to be used for the side-effect of printing to
the console the data
object received as input by the
compute_group()
or compute_panel()
function, or a summary of
it. These data
objects are the same as those received as input by
any other statistics passed the same arguments. By default, the applied
function is I()
, the identity function.
In principle any geom can be passed as argument to override geom =
"null"
. However, geom = "debug_panel"
and geom =
"debug_group"
are treated as special cases and functions
geom.dbgfun.data
and geom.dbgfun.params
, and lists
geom.dbgfun.data.args
and geom.dbgfun.params.args
renamed and
passed to the geometry. Arguments passed to these four formal parameters
are not passed to other geometries.
Keep in mind that this stat sets default mappings only for the x
and/or y aesthetics, additional mappings can be set using
aes()
, possibly together with after_stat()
.
A copy of its data
input, which is an object of class
"data.frame"
or inheriting from "data.frame"
.
x at centre of range
y at centre of range
nrow()
of data
object
ncol()
of data
object
colnames()
of data
object
class()
of x
and y
columns in
data
object
all distinct values in group as passed in
data
object
all distinct values in PANEL as passed in
data
object
my.df <- data.frame(x = rep(1:10, 2), y = rep(c(1,2), c(10,10)) + rnorm(20), group = rep(c("A","B"), c(10,10))) # by default head() is used to show the top rows of the data object # and geom_null() to silence the data returned by the stat ggplot(my.df, aes(x,y)) + geom_point() + stat_debug_group() # geom_debug prints the data returned by the stat ggplot(my.df, aes(x,y)) + geom_point() + stat_debug_group(geom = "debug_panel") # geom_debug prints the data returned by the stat ggplot(my.df, aes(x,y)) + geom_point() + stat_debug_group(geom = "debug_panel", geom.dbgfun.params = "summary") # to print only the the data returned by the stat # we pass as summary function a function that always returns NULL ggplot(my.df, aes(x,y)) + geom_point() + stat_debug_group(geom = "debug_panel", dbgfun.data = function(x) {NULL}) ggplot(my.df, aes(x,y)) + geom_point() + stat_debug_group(aes(label = paste("group:", group)), geom = "text") # here we show all the data object ggplot(my.df, aes(x,y)) + geom_point() + stat_debug_group(dbgfun.data = "I") # with grouping ggplot(my.df, aes(x,y, colour = group)) + geom_point() + stat_debug_group() ggplot(my.df, aes(x,y, colour = group)) + geom_point() + stat_debug_panel() ggplot(my.df, aes(x, y, colour = group)) + geom_point() + stat_debug_group(dbgfun.data = "nrow") ggplot(my.df, aes(x, y)) + geom_point() + facet_wrap(~group) + stat_debug_group() # by default head() is used to show the top rows of data object ggplot(my.df, aes(group,y)) + geom_point() + stat_debug_group()
my.df <- data.frame(x = rep(1:10, 2), y = rep(c(1,2), c(10,10)) + rnorm(20), group = rep(c("A","B"), c(10,10))) # by default head() is used to show the top rows of the data object # and geom_null() to silence the data returned by the stat ggplot(my.df, aes(x,y)) + geom_point() + stat_debug_group() # geom_debug prints the data returned by the stat ggplot(my.df, aes(x,y)) + geom_point() + stat_debug_group(geom = "debug_panel") # geom_debug prints the data returned by the stat ggplot(my.df, aes(x,y)) + geom_point() + stat_debug_group(geom = "debug_panel", geom.dbgfun.params = "summary") # to print only the the data returned by the stat # we pass as summary function a function that always returns NULL ggplot(my.df, aes(x,y)) + geom_point() + stat_debug_group(geom = "debug_panel", dbgfun.data = function(x) {NULL}) ggplot(my.df, aes(x,y)) + geom_point() + stat_debug_group(aes(label = paste("group:", group)), geom = "text") # here we show all the data object ggplot(my.df, aes(x,y)) + geom_point() + stat_debug_group(dbgfun.data = "I") # with grouping ggplot(my.df, aes(x,y, colour = group)) + geom_point() + stat_debug_group() ggplot(my.df, aes(x,y, colour = group)) + geom_point() + stat_debug_panel() ggplot(my.df, aes(x, y, colour = group)) + geom_point() + stat_debug_group(dbgfun.data = "nrow") ggplot(my.df, aes(x, y)) + geom_point() + facet_wrap(~group) + stat_debug_group() # by default head() is used to show the top rows of data object ggplot(my.df, aes(group,y)) + geom_point() + stat_debug_group()
A str()
method tailored to objects of class "ggplot"
. It adds
to the output the size of the object, and the ability to subset
individual components.
## S3 method for class 'ggplot' str( object, ..., max.level = 1, components = TRUE, vec.len = 2, list.len = 99, give.attr = FALSE, comp.str = "$ ", nest.lev = 0, indent.str = paste(rep.int(" ", max(0, nest.lev + 1)), collapse = ".."), size = TRUE )
## S3 method for class 'ggplot' str( object, ..., max.level = 1, components = TRUE, vec.len = 2, list.len = 99, give.attr = FALSE, comp.str = "$ ", nest.lev = 0, indent.str = paste(rep.int(" ", max(0, nest.lev + 1)), collapse = ".."), size = TRUE )
object |
ggplot Plot object with embedded data. |
... |
accept additional parameter arguments |
max.level |
integer Maximum depth of recursion (of lists within lists ...) to be printed. |
components |
Vector of components to print, as indexes into
|
vec.len |
integer Approximate maximum length allowed when showing the first few values of a vector. |
list.len |
integer Maximum number of components to show of any list that will be described. |
give.attr |
logical Flag, determining whether a description of attributes will be shown. |
comp.str |
character String to be used for separating list components. |
nest.lev |
numeric current nesting level in the recursive calls to
|
indent.str |
character String used for each level of indentation. |
size |
logical Flag, should the size of the object in bytes be printed? |
A NULL is returned invisibly. While a description of the
structure of p
or its components will be printed in outline form as
a "side-effect", with indentation for each level of recursion, showing the
internal storage mode, class(es) if any, attributes, and first few
elements of each data vector. By default each level of list recursion is
indicated and attributes enclosed in angle brackets.
In the case of objects with a nested structure str()
is called
recursively and dispatched according to the class of each nested member.
A summary
method for class ggplot
is defined by
package 'ggplot2'. Method summary()
provides a more compact
description of "ggplot"
objects than method str()
. Here we
provide a wrapper on R's str()
with different default arguments. A
summary does not directly describe how the different components of an R
object are stored, while the structure does.
p <- ggplot(mpg, aes(factor(year), (cty + hwy) / 2)) + geom_boxplot() + geom_point(color = "red") + facet_grid(. ~ class) + ggtitle("Example plot") p # str(p) vs. summary(p) str(p) summary(p) # from pacakge 'ggplot2' # structure of p at 2 levels of nesting str(p, max.level = 2, size = FALSE) # top level structure and size of p str(p, max.level = 0) # names of ggplot members names(p) # structure and size of p["data"] str(p, max.level = 2, components = "data") # structure and size of p["layers"] str(p, max.level = 1, components = "layers")
p <- ggplot(mpg, aes(factor(year), (cty + hwy) / 2)) + geom_boxplot() + geom_point(color = "red") + facet_grid(. ~ class) + ggtitle("Example plot") p # str(p) vs. summary(p) str(p) summary(p) # from pacakge 'ggplot2' # structure of p at 2 levels of nesting str(p, max.level = 2, size = FALSE) # top level structure and size of p str(p, max.level = 0) # names of ggplot members names(p) # structure and size of p["data"] str(p, max.level = 2, components = "data") # structure and size of p["layers"] str(p, max.level = 1, components = "layers")