Feature request: `is_dupes()` #485

brunomioto · 2022-06-29T17:09:42Z

Feature requests

The get_dupes() function returns a data.frame with the duplicated names.
The new is_dupes() would return a TRUE/FALSE when there's another name/value duplicated on the dataset.

data.frame(
  x = c("Brazil", "US", "Brazil", "China", "UK"),
  is_dupes = c(TRUE, FALSE, TRUE, FALSE, FALSE)
)
#>        x is_dupes
#> 1 Brazil     TRUE
#> 2     US    FALSE
#> 3 Brazil     TRUE
#> 4  China    FALSE
#> 5     UK    FALSE

^{Created on 2022-06-29 by the reprex package (v2.0.1)}

The text was updated successfully, but these errors were encountered:

sfirke · 2023-01-30T21:37:31Z

I think these two lines do it:

mtcars %>%
  dplyr::add_count(mpg, name = "is_dupe") %>%
  dplyr::mutate(as.logical(is_dupe-1))

A janitor function would likely just be wrapping that. Given that it's just two lines in dplyr, and the limited capacity for developing janitor, I'm going to close this as unplanned. And I apologize for taking so long to acknowledge this suggestion! 😔

sfirke closed this as not planned Won't fix, can't repro, duplicate, stale Jan 30, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature request: `is_dupes()` #485

Feature request: `is_dupes()` #485

brunomioto commented Jun 29, 2022

sfirke commented Jan 30, 2023

Feature request: is_dupes() #485

Feature request: is_dupes() #485

Comments

brunomioto commented Jun 29, 2022

Feature requests

sfirke commented Jan 30, 2023

Feature request: `is_dupes()` #485

Feature request: `is_dupes()` #485