'Column reference data.table function R

I'm trying to make a function that calls to a column in the data table supplied as one of the arguments below:

df <- read.table(text = "x1 x2 y
CA 20 50
CA 30.5 100
CA 40.5 200
AZ 20.12 400
AZ 25 500
OR 86 600
OR 75 700
OR 45 800", header = TRUE)

df$x1 <- as.factor(df$x1)

library(data.table)

make_freq <- function(df, var_name){
  
  df <- df 
  setDT(df)
  
  
  tb <- df[, .N, by = var_name][,prop_ := round(((N/sum(N))*100), digits = 0)][order(var_name)]
  
  gg1 <- ggplot(tb, aes(x = var_name, y = prop_)) +
    geom_bar(width = .35, stat = "identity", color = "darkblue", fill = "darkblue") +
    ggtitle(paste0("var_name")) +
    theme_bw() +
    theme(plot.title = element_text(size = 10)) +
    theme(axis.text.x = element_text(angle = 45)) 
  
  return(list(figure = gg1))
}

make_freq(df = df, var_name = x1)

Ideally I want to be able to run the function so that I can create the ggplot figure for any categorical variable I want using the var_name argument. I'm getting Object x1 not found error which makes me think I need to quote or unquote the var_name argument within the function or something.



Solution 1:[1]

Yes, if you would like to use non-standard evaluation you will need to quote the var_name argument. Simply add:

var_name <- substitute(var_name)

to the top of the function. Note that the default x axis label in this case will be var_name. If you would like it to default to whatever is passed as var_name you will need to do a couple of extra steps. Change the top of the function to:

  x <- enquo(var_name)
  var_name <- substitute(var_name)

Then modify the tb line.

  tb <- df[, .N, by = eval(deparse(var_name))][,prop_ := round(((N/sum(N))*100), digits = 0)][order(eval(var_name))]

Then in ggplot():

gg1 <- ggplot(tb, aes(x = !!x, y = prop_)) + ...

Sources

This article follows the attribution requirements of Stack Overflow and is licensed under CC BY-SA 3.0.

Source: Stack Overflow

Solution Source
Solution 1 cazman