validata

R 패키지 메타데이터와 수집 신호를 모아 봅니다.

Packages / CRAN / validata

validata

v0.1.1
Repository CRANLicense MIT + file LICENSELifecycle activeNeeds compilation no
DOI
10.32614/CRAN.package.validata
Reverse imports
41

Core Signals

첫 화면에서 판단해야 할 수집 신호를 먼저 배치합니다.

1
Reverse imports
41

Supported Backends

DESCRIPTION에서 감지한 backend 관련 package입니다.

1
D
data.table
Imports

Quick Facts

기본 메타데이터를 작은 카드와 토큰으로 압축합니다.

profile
Repository
CRAN
Version
0.1.1
License
MIT + file LICENSE
Lifecycle
active
Needs compilation
no
Reverse imports
41
Last observed
2026-05-30
CRAN
cran.r-project.org/package=validata

수집 소스별 패키지 정보

1개 소스
CRAN
0.1.1
2026-05-30
License
MIT + file LICENSE
Depends
R (>= 2.10)
Imports
dplyr, stringr, janitor, rlang, tidyselect, purrr, magrittr, tidyr, tibble, gtools, listviewer, data.table, scales, utils, framecleaner, badger, rlist
Suggests
knitr, rmarkdown, testit
Needs compilation
no
Reverse imports
41
Lifecycle
active
Last observed
2026-05-30 10:45:11

이 패키지가 의존하는 패키지

5개 표시전체 20개
PackageTypeSpec
badger
CRAN · 0.1.1 · 2026-05-30
Importsbadger
data.table
CRAN · 0.1.1 · 2026-05-30
Importsdata.table
dplyr
CRAN · 0.1.1 · 2026-05-30
Importsdplyr
framecleaner
CRAN · 0.1.1 · 2026-05-30
Importsframecleaner
gtools
CRAN · 0.1.1 · 2026-05-30
Importsgtools
1 / 4

이 패키지를 쓰는 패키지

1개 표시전체 1개
PackageTypeSpec
TidyConsultant
0.1.2
CRAN · 2026-05-30
Importsvalidata
1 / 1

Reverse dependency summary

1 types
TypePackages
Imports1

패키지 페이지

Reverse imports
2
All links
45
Repository
CRAN
Version
0.1.1
Collected
2026-05-20 06:53:49
Package page
https://cran.r-project.org/web/packages/validata/index.html
DOI
10.32614/CRAN.package.validata
CRAN checks
https://cran.r-project.org/web/checks/check_results_validata.html
README
https://cran.r-project.org/web/packages/validata/readme/README.html
NEWS
https://cran.r-project.org/web/packages/validata/news/news.html
Reference HTML
https://cran.r-project.org/web/packages/validata/refman/validata.html
Reference PDF
https://cran.r-project.org/web/packages/validata/validata.pdf
Source package
https://cran.r-project.org/src/contrib/validata_0.1.1.tar.gz
Archive
https://CRAN.R-project.org/src/contrib/Archive/validata
Page fields
Author
Harrison Tietze [aut, cre]
BugReports
https://github.com/Harrison4192/validata/issues
CRAN Checks
validata results
DOI
10.32614/CRAN.package.validata
License
MIT + file LICENSE
Maintainer
Harrison Tietze <Harrison4192 at gmail.com>
Materials
README , NEWS
NeedsCompilation
no
Old Sources
validata archive
Package Source
validata_0.1.1.tar.gz
Published
2026-02-26
Reference Manual
validata.html , validata.pdf
Reverse Imports
TidyConsultant
URL
https://harrison4192.github.io/validata/ , https://github.com/Harrison4192/validata
Version
0.1.1
Vignettes
validata ( source , R code )
Windows Binaries
r-devel: validata_0.1.1.zip , r-release: validata_0.1.1.zip , r-oldrel: validata_0.1.1.zip
MacOS Binaries
r-release (arm64): validata_0.1.1.tgz , r-oldrel (arm64): validata_0.1.1.tgz , r-release (x86_64): validata_0.1.1.tgz , r-oldrel (x86_64): validata_0.1.1.tgz
Version
0.1.1
Published
2026-02-26
DOI
10.32614/CRAN.package.validata
Author
Harrison Tietze [aut, cre]
Maintainer
Harrison Tietze <Harrison4192 at gmail.com>
BugReports
https://github.com/Harrison4192/validata/issues
License
MIT + file LICENSE
URL
https://harrison4192.github.io/validata/ , https://github.com/Harrison4192/validata
NeedsCompilation
no
Materials
README , NEWS
CRAN Checks
validata results
Reference Manual
validata.html , validata.pdf
Vignettes
validata ( source , R code )
Package Source
validata_0.1.1.tar.gz
Windows Binaries
r-devel: validata_0.1.1.zip , r-release: validata_0.1.1.zip , r-oldrel: validata_0.1.1.zip
MacOS Binaries
r-release (arm64): validata_0.1.1.tgz , r-oldrel (arm64): validata_0.1.1.tgz , r-release (x86_64): validata_0.1.1.tgz , r-oldrel (x86_64): validata_0.1.1.tgz
Old Sources
validata archive
Reverse Imports
TidyConsultant
Page sections 4
Documentation
Heading
Documentation
Links
[{"label":"validata.html","section":"","type":"","url":"https://cran.r-project.org/web/packages/validata/refman/validata.html"},{"label":"validata.pdf","section":"","type":"","url":"https://cran.r-project.org/web/packages/validata/validata.pdf"},{"label":"validata","section":"","type":"","url":"https://cran.r-project.org/web/packages/validata/vignettes/validata.html"},{"label":"source","section":"","type":"","url":"https://cran.r-project.org/web/packages/validata/vignettes/validata.Rmd"},{"label":"R code","section":"","type":"","url":"https://cran.r-project.org/web/packages/validata/vignettes/validata.R"}]
Text
Reference manual: validata.html , validata.pdf Vignettes: validata ( source , R code )
Downloads
Heading
Downloads
Links
[{"label":"validata_0.1.1.tar.gz","section":"","type":"","url":"https://cran.r-project.org/src/contrib/validata_0.1.1.tar.gz"},{"label":"validata_0.1.1.zip","section":"","type":"","url":"https://cran.r-project.org/bin/windows/contrib/4.7/validata_0.1.1.zip"},{"label":"validata_0.1.1.zip","section":"","type":"","url":"https://cran.r-project.org/bin/windows/contrib/4.6/validata_0.1.1.zip"},{"label":"validata_0.1.1.zip","section":"","type":"","url":"https://cran.r-project.org/bin/windows/contrib/4.5/validata_0.1.1.zip"},{"label":"validata_0.1.1.tgz","section":"","type":"","url":"https://cran.r-project.org/bin/macosx/sonoma-arm64/contrib/4.6/validata_0.1.1.tgz"},{"label":"validata_0.1.1.tgz","section":"","type":"","url":"https://cran.r-project.org/bin/macosx/big-sur-arm64/contrib/4.5/validata_0.1.1.tgz"},{"label":"validata_0.1.1.tgz","section":"","type":"","url":"https://cran.r-project.org/bin/macosx/big-sur-x86_64/contrib/4.6/validata_0.1.1.tgz"},{"label":"validata_0.1.1.tgz","section":"","type":"","url":"https://cran.r-project.org/bin/macosx/big-sur-x86_64/contrib/4.5/validata_0.1.1.tgz"},{"label":"validata archive","section":"","type":"","url":"https://CRAN.R-project.org/src/contrib/Archive/validata"}]
Text
Package source: validata_0.1.1.tar.gz Windows binaries: r-devel: validata_0.1.1.zip , r-release: validata_0.1.1.zip , r-oldrel: validata_0.1.1.zip macOS binaries: r-release (arm64): validata_0.1.1.tgz , r-oldrel (arm64): validata_0.1.1.tgz , r-release (x86_64): validata_0.1.1.tgz , r-oldrel (x86_64): validata_0.1.1.tgz Old sources: validata archive
Reverse dependencies
Heading
Reverse dependencies
Links
[{"label":"TidyConsultant","section":"","type":"","url":"https://cran.r-project.org/web/packages/TidyConsultant/index.html"}]
Text
Reverse imports: TidyConsultant
Linking
Heading
Linking
Links
[{"label":"https://CRAN.R-project.org/package=validata","section":"","type":"","url":"https://CRAN.R-project.org/package=validata"}]
Text
Please use the canonical form https://CRAN.R-project.org/package=validata to link to this page.
Materials 2
Documentation 5
Vignettes 3
Downloads 9
All page links 45

패키지 문서 원문

4 artifacts
field
NEWS
CRAN · 0.1.1 · Materials · text/html · 859 · 2026-05-07
Title
NEWS
Label
NEWS
Text content
Text content
NEWS code{white-space: pre-wrap;} span.smallcaps{font-variant: small-caps;} span.underline{text-decoration: underline;} div.column{display: inline-block; vertical-align: top; width: 50%;} div.hanging-indent{margin-left: 1.5em; text-indent: -1.5em;} ul.task-list{list-style: none;} validata 0.1.1 validata 0.1.0 Added a NEWS.md file to track changes to the package.
field
README
CRAN · 0.1.1 · Materials · text/html · 6,109 · 2026-05-07
Title
README
Label
README
Text content
Text content
README code{white-space: pre-wrap;} span.smallcaps{font-variant: small-caps;} span.underline{text-decoration: underline;} div.column{display: inline-block; vertical-align: top; width: 50%;} div.hanging-indent{margin-left: 1.5em; text-indent: -1.5em;} ul.task-list{list-style: none;} pre > code.sourceCode { white-space: pre; position: relative; } pre > code.sourceCode > span { display: inline-block; line-height: 1.25; } pre > code.sourceCode > span:empty { height: 1.2em; } .sourceCode { overflow: visible; } code.sourceCode > span { color: inherit; text-decoration: inherit; } div.sourceCode { margin: 1em 0; } pre.sourceCode { margin: 0; } @media screen { div.sourceCode { overflow: auto; } } @media print { pre > code.sourceCode { white-space: pre-wrap; } pre > code.sourceCode > span { text-indent: -5em; padding-left: 5em; } } pre.numberSource code { counter-reset: source-line 0; } pre.numberSource code > span { position: relative; left: -4em; counter-increment: source-line; } pre.numberSource code > span > a:first-child::before { content: counter(source-line); position: relative; left: -1em; text-align: right; vertical-align: baseline; border: none; display: inline-block; -webkit-touch-callout: none; -webkit-user-select: none; -khtml-user-select: none; -moz-user-select: none; -ms-user-select: none; user-select: none; padding: 0 4px; width: 4em; color: #aaaaaa; } pre.numberSource { margin-left: 3em; border-left: 1px solid #aaaaaa; padding-left: 4px; } div.sourceCode { } @media screen { pre > code.sourceCode > span > a:first-child::before { text-decoration: underline; } } code span.al { color: #ff0000; font-weight: bold; } /* Alert */ code span.an { color: #60a0b0; font-weight: bold; font-style: italic; } /* Annotation */ code span.at { color: #7d9029; } /* Attribute */ code span.bn { color: #40a070; } /* BaseN */ code span.bu { color: #008000; } /* BuiltIn */ code span.cf { color: #007020; font-weight: bold; } /* ControlFlow */ code span.ch { color: #4070a0; } /* Char */ code span.cn { color: #880000; } /* Constant */ code span.co { color: #60a0b0; font-style: italic; } /* Comment */ code span.cv { color: #60a0b0; font-weight: bold; font-style: italic; } /* CommentVar */ code span.do { color: #ba2121; font-style: italic; } /* Documentation */ code span.dt { color: #902000; } /* DataType */ code span.dv { color: #40a070; } /* DecVal */ code span.er { color: #ff0000; font-weight: bold; } /* Error */ code span.ex { } /* Extension */ code span.fl { color: #40a070; } /* Float */ code span.fu { color: #06287e; } /* Function */ code span.im { color: #008000; font-weight: bold; } /* Import */ code span.in { color: #60a0b0; font-weight: bold; font-style: italic; } /* Information */ code span.kw { color: #007020; font-weight: bold; } /* Keyword */ code span.op { color: #666666; } /* Operator */ code span.ot { color: #007020; } /* Other */ code span.pp { color: #bc7a00; } /* Preprocessor */ code span.sc { color: #4070a0; } /* SpecialChar */ code span.ss { color: #bb6688; } /* SpecialString */ code span.st { color: #4070a0; } /* String */ code span.va { color: #19177c; } /* Variable */ code span.vs { color: #4070a0; } /* VerbatimString */ code span.wa { color: #60a0b0; font-weight: bold; font-style: italic; } /* Warning */ validata The goal of validata is to provide functions for validating the structure and properties of data frames. Installation You can install the released version of validata from CRAN with: install.packages ( "validata" ) And the development version from GitHub with: # install.packages("devtools") devtools :: install_github ( "Harrison4192/validata" )
reference_manual_html
Reference manual HTML
CRAN · 0.1.1 · Documentation · text/html · 23,048 · 2026-05-07
Title
Help for package validata
Label
Reference manual HTML
Text content
Text content
Help for package validata const macros = { "\\R": "\\textsf{R}", "\\mbox": "\\text", "\\code": "\\texttt"}; function processMathHTML() { var l = document.getElementsByClassName('reqn'); for (let e of l) { katex.render(e.textContent, e, { throwOnError: false, macros }); } return; } Package {validata} Contents validata-package %>% confirm_distinct confirm_mapping confirm_overlap confirm_overlap_internal confirm_strlen data_mode determine_distinct determine_mapping determine_overlap diagnose diagnose_category diagnose_missing diagnose_numeric mode_fn mode_pct n_dupes sample_data1 top_n_vals view_missing Title: Validate Data Frames Version: 0.1.1 Maintainer: Harrison Tietze <Harrison4192@gmail.com> Description: Functions for validating the structure and properties of data frames. Answers essential questions about a data set after initial import or modification. What are the unique or missing values? What columns form a primary key? What are the properties of the numeric or categorical columns? What kind of overlap or mapping exists between 2 columns? License: MIT + file LICENSE URL: https://harrison4192.github.io/validata/ , https://github.com/Harrison4192/validata BugReports: https://github.com/Harrison4192/validata/issues Encoding: UTF-8 LazyData: true RoxygenNote: 7.3.3 Imports: dplyr, stringr, janitor, rlang, tidyselect, purrr, magrittr, tidyr, tibble, gtools, listviewer, data.table, scales, utils, framecleaner, badger, rlist Suggests: knitr, rmarkdown, testit VignetteBuilder: knitr Depends: R (≥ 2.10) NeedsCompilation: no Packaged: 2026-02-25 14:06:29 UTC; harrisontietze Author: Harrison Tietze [aut, cre] Repository: CRAN Date/Publication: 2026-02-26 10:00:02 UTC validata: Validate Data Frames Description Functions for validating the structure and properties of data frames. Answers essential questions about a data set after initial import or modification. What are the unique or missing values? What columns form a primary key? What are the properties of the numeric or categorical columns? What kind of overlap or mapping exists between 2 columns? Author(s) Maintainer : Harrison Tietze Harrison4192@gmail.com See Also Useful links: https://harrison4192.github.io/validata/ https://github.com/Harrison4192/validata Report bugs at https://github.com/Harrison4192/validata/issues Pipe operator Description See magrittr:: %>% for details. Usage lhs %>% rhs Confirm Distinct Description Confirm whether the rows of a data frame can be uniquely identified by the keys in the selected columns. Also reports whether the dataframe has duplicates. If so, it is best to remove duplicates and re-run the function. Usage confirm_distinct(.data, ...) Arguments .data A dataframe ... (ID) columns Value a Logical value invisibly with description printed to console Examples iris %>% confirm_distinct(Species, Sepal.Width) Confirm structural mapping between 2 columns Description The mapping between elements of 2 columns can have 4 different relationships: one - one, one - many, many - one, many - many. This function returns a view of the mappings by row, and prints a summary to the console. Usage confirm_mapping(.data, col1, col2, view = T) Arguments .data a data frame col1 column 1 col2 column 2 view View results? Value A view of mappings. Also returns the view as a data frame invisibly. Examples iris %>% confirm_mapping(Species, Sepal.Width, view = FALSE) Confirm Overlap Description Prints a venn-diagram style summary of the unique value overlap between two columns and also invisibly returns a dataframe that can be assigned to a variable and queried with the overlap helpers. The helpers can return values that appeared only the first col, second col, or both cols. Usage confirm_overlap(vec1, vec2, return_tibble = F) co_find_only_in_1(co_output) co_find_only_in_2(co_output) co_find_in_both(co_output) Arguments vec1 vector 1 vec2 vector 2 return_tibble logical. If TRUE, returns a tibble. otherwise by default returns the database invisibly to be queried by helper functions. co_output dataframe output from confirm_overlap Value tibble. overlap summary or overlap table Examples confirm_overlap(iris$Sepal.Width, iris$Sepal.Length) -> iris_overlap iris_overlap iris_overlap %>% co_find_only_in_1() iris_overlap %>% co_find_only_in_2() iris_overlap %>% co_find_in_both() Confirm Overlap internal Description A venn style summary of the overlap in unique values of 2 vectors Usage confirm_overlap_internal(vec1, vec2) Arguments vec1 vector 1 vec2 vector 2 Value 1 row tibble Examples confirm_overlap(iris$Sepal.Width, iris$Sepal.Length) confirm string length Description returns a count table of string lengths for a character column. The helper function choose_strlen filters dataframe for rows containing specific string length for the specified column. Usage confirm_strlen(mdb, col) choose_strlen(cs_output, len) Arguments mdb dataframe col unquoted column cs_output dataframe. output from confirm_strlen len integer vector. Value prints a summary and returns a dataframe invisibly dataframe with original columns, filtered to the specific string length Examples iris %>% tibble::as_tibble() %>% confirm_strlen(Species) -> iris_cs_output iris_cs_output iris_cs_output %>% choose_strlen(6) data_mode Description data_mode Usage data_mode(x, prop = TRUE) Arguments x vector prop show frequency as ratio? default T Value named double of length 1 Automatically determine primary key Description Uses confirm_distinct in an iterative fashion to determine the primary keys. Usage determine_distinct(df, ..., listviewer = TRUE) Arguments df a data frame ... columns or a tidyselect specification. defaults to everything listviewer logical. defaults to TRUE to view output using the listviewer package Details The goal of this function is to automatically determine which columns uniquely identify the rows of a dataframe. The output is a printed description of the combination of columns that form unique identifiers at each level. At level 1, the function tests if individual columns are primary keys At level 2, the function tests n C 2 combinations of columns to see if they form primary keys. The final level is testing all columns at once. For completely unique columns, they are recorded in level 1, but then dropped from the data frame to facilitate the determination of multi-column primary keys. If the dataset contains duplicated rows, they are eliminated before proceeding. Value list Examples sample_data1 %>% head ## on level 1, each column is tested as a unique identifier. the VAL columns have no ## duplicates and hence qualify, even though they normally would be considered as IDs ## on level 3, combinations of 3 columns are tested. implying that ID_COL 1,2,3 form a unique key ## level 2 does not appear, implying that combinations of any 2 ID_COLs do not form a unique key sample_data1 %>% determine_distinct(listviewer = FALSE) Determine pairwise structural mappings Description Determine pairwise structural mappings Usage determine_mapping(df, ..., listviewer = TRUE) Arguments df a data frame ... columns or a tidyselect specification listviewer logical. defaults to TRUE to view output using the listviewer package Value description of mappings Examples iris %>% determine_mapping(listviewer = FALSE) Determine Overlap Description Uses confirm_overlap in a pairise fashion to see venn style comparison of unique values between the columns chosen by a tidyselect specification. Usage determine_overlap(db, ...) Arguments db a data frame ... tidyselect specification. Default being everything. Value tibble Examples iris %>% determine_overlap() diagnose Description Pipe in a dataframe to return a diagnosis of its missing and unique values for each columns. Default behavior is to diagnose all columns, but a subset can be specified in the dots with tidyselect. Usage diagnose(df, ...) Arguments df dataframe ... tidyselect Details this function is inspired by the excellent dlookr package. It takes a dataframe and returns a summary of uni
section
validata.pdf
CRAN · 0.1.1 · Documentation · application/pdf · 97,385 · 2026-05-07
Title
validata.pdf
Label
validata.pdf

Reference for validata (0.1.1)

21개 topic
%>%
Pipe operator
CRAN · 0.1.1 · validata/man/pipe.Rd · 2026-05-07

See magrittr::[magrittr:pipe]%>% for details.

Aliases
%>%
Keywords
internal
Usage
lhs %>% rhs
confirm_distinct
Confirm Distinct
CRAN · 0.1.1 · validata/man/confirm_distinct.Rd · 2026-05-07

Confirm whether the rows of a data frame can be uniquely identified by the keys in the selected columns. Also reports whether the dataframe has duplicates. If so, it is best to remove duplicates and re-run the function.

Aliases
confirm_distinct
Usage
confirm_distinct(.data, ...)
Arguments
.data
A dataframe
...
(ID) columns
Value
a Logical value invisibly with description printed to console
Examples
iris %>% confirm_distinct(Species, Sepal.Width)
confirm_mapping
Confirm structural mapping between 2 columns
CRAN · 0.1.1 · validata/man/confirm_mapping.Rd · 2026-05-07

The mapping between elements of 2 columns can have 4 different relationships: one - one, one - many, many - one, many - many. This function returns a view of the mappings by row, and prints a summary to the console.

Aliases
confirm_mapping
Usage
confirm_mapping(.data, col1, col2, view = T)
Arguments
.data
a data frame
col1
column 1
col2
column 2
view
View results?
Value
A view of mappings. Also returns the view as a data frame invisibly.
Examples
iris %>% confirm_mapping(Species, Sepal.Width, view = FALSE)
confirm_overlap
Confirm Overlap
CRAN · 0.1.1 · validata/man/confirm_overlap.Rd · 2026-05-07

Prints a venn-diagram style summary of the unique value overlap between two columns and also invisibly returns a dataframe that can be assigned to a variable and queried with the overlap helpers. The helpers can return values that appeared only the first col, second col, or both cols.

Aliases
confirm_overlapco_find_only_in_1co_find_only_in_2co_find_in_both
Usage
confirm_overlap(vec1, vec2, return_tibble = F) co_find_only_in_1(co_output) co_find_only_in_2(co_output) co_find_in_both(co_output)
Arguments
vec1
vector 1
vec2
vector 2
return_tibble
logical. If TRUE, returns a tibble. otherwise by default returns the database invisibly to be queried by helper functions.
co_output
dataframe output from confirm_overlap
Value
tibble. overlap summary or overlap table
Examples
confirm_overlap(iris$Sepal.Width, iris$Sepal.Length) -> iris_overlap iris_overlap iris_overlap %>% co_find_only_in_1() iris_overlap %>% co_find_only_in_2() iris_overlap %>% co_find_in_both()
confirm_overlap_internal
Confirm Overlap internal
CRAN · 0.1.1 · validata/man/confirm_overlap_internal.Rd · 2026-05-07

A venn style summary of the overlap in unique values of 2 vectors

Aliases
confirm_overlap_internal
Keywords
internal
Usage
confirm_overlap_internal(vec1, vec2)
Arguments
vec1
vector 1
vec2
vector 2
Value
1 row tibble
Examples
confirm_overlap(iris$Sepal.Width, iris$Sepal.Length)
confirm_strlen
confirm string length
CRAN · 0.1.1 · validata/man/confirm_strlen.Rd · 2026-05-07

returns a count table of string lengths for a character column. The helper function choose_strlen filters dataframe for rows containing specific string length for the specified column.

Aliases
confirm_strlenchoose_strlen
Usage
confirm_strlen(mdb, col) choose_strlen(cs_output, len)
Arguments
mdb
dataframe
col
unquoted column
cs_output
dataframe. output from confirm_strlen
len
integer vector.
Value
prints a summary and returns a dataframe invisibly dataframe with original columns, filtered to the specific string length
Examples
iris %>% tibble::as_tibble() %>% confirm_strlen(Species) -> iris_cs_output iris_cs_output iris_cs_output %>% choose_strlen(6)
data_mode
CRAN · 0.1.1 · validata/man/data_mode.Rd · 2026-05-07

data_mode

Aliases
data_mode
Keywords
internal
Usage
data_mode(x, prop = TRUE)
Arguments
x
vector
prop
show frequency as ratio? default T
Value
named double of length 1
determine_distinct
Automatically determine primary key
CRAN · 0.1.1 · validata/man/determine_distinct.Rd · 2026-05-07

Uses confirm_distinct in an iterative fashion to determine the primary keys.

Aliases
determine_distinct
Usage
determine_distinct(df, ..., listviewer = TRUE)
Arguments
df
a data frame
...
columns or a tidyselect specification. defaults to everything
listviewer
logical. defaults to TRUE to view output using the listviewer package
Details
The goal of this function is to automatically determine which columns uniquely identify the rows of a dataframe. The output is a printed description of the combination of columns that form unique identifiers at each level. At level 1, the function tests if individual columns are primary keys At level 2, the function tests n C 2 combinations of columns to see if they form primary keys. The final level is testing all columns at once. For completely unique columns, they are recorded in level 1, but then dropped from the data frame to facilitate the determination of multi-column primary keys. If the dataset contains duplicated rows, they are eliminated before proceeding.
Value
list
Examples
sample_data1 %>% head ## on level 1, each column is tested as a unique identifier. the VAL columns have no ## duplicates and hence qualify, even though they normally would be considered as IDs ## on level 3, combinations of 3 columns are tested. implying that ID_COL 1,2,3 form a unique key ## level 2 does not appear, implying that combinations of any 2 ID_COLs do not form a unique key sample_data1 %>% determine_distinct(listviewer = FALSE)
determine_mapping
Determine pairwise structural mappings
CRAN · 0.1.1 · validata/man/determine_mapping.Rd · 2026-05-07

Determine pairwise structural mappings

Aliases
determine_mapping
Usage
determine_mapping(df, ..., listviewer = TRUE)
Arguments
df
a data frame
...
columns or a tidyselect specification
listviewer
logical. defaults to TRUE to view output using the listviewer package
Value
description of mappings
Examples
iris %>% determine_mapping(listviewer = FALSE)
determine_overlap
Determine Overlap
CRAN · 0.1.1 · validata/man/determine_overlap.Rd · 2026-05-07

Uses confirm_overlap in a pairise fashion to see venn style comparison of unique values between the columns chosen by a tidyselect specification.

Aliases
determine_overlap
Usage
determine_overlap(db, ...)
Arguments
db
a data frame
...
tidyselect specification. Default being everything.
Value
tibble
Examples
iris %>% determine_overlap()
diagnose
CRAN · 0.1.1 · validata/man/diagnose.Rd · 2026-05-07

Pipe in a dataframe to return a diagnosis of its missing and unique values for each columns. Default behavior is to diagnose all columns, but a subset can be specified in the dots with tidyselect.

Aliases
diagnose
Usage
diagnose(df, ...)
Arguments
df
dataframe
...
tidyselect
Details
this function is inspired by the excellent https://choonghyunryu.github.io/dlookr/dlookr package. It takes a dataframe and returns a summary of unique and missing values of the columns.
Value
dataframe summary
Examples
diagnose(iris)
diagnose_category
diagnose category
CRAN · 0.1.1 · validata/man/diagnose_category.Rd · 2026-05-07

counts the distinct entries of categorical variables. The max_distinct argument limits the scope to categorical variables with a maximum number of unique entries, to prevent overflow.

Aliases
diagnose_category
Usage
diagnose_category(.data, ..., max_distinct = 5)
Arguments
.data
dataframe
...
tidyselect
max_distinct
integer
Value
dataframe
Examples
diagnose_category(iris)
diagnose_missing
CRAN · 0.1.1 · validata/man/diagnose_missing.Rd · 2026-05-07

faster than diagnose if emphasis is on diagnosing missing values. Also, only shows the columns with any missing values.

Aliases
diagnose_missing
Usage
diagnose_missing(df, ...)
Arguments
df
dataframe
...
optional tidyselect
Value
tibble summary
Examples
diagnose_missing(tibble::tibble(x = c(NA, 1)))
diagnose_numeric
CRAN · 0.1.1 · validata/man/diagnose_numeric.Rd · 2026-05-07

Inputs a dataframe and returns various summary statistics of the numeric columns. For example zeros returns the ratio of 0 values in that column. minus counts negative values and infs counts Inf values. Other rarer metrics are also returned that may be helpful for quick diagnosis or understanding of numeric data. mode returns the most common value in the column (chooses at random in case of tie) , and mode_ratio returns its frequency as a ratio of the total rows

Aliases
diagnose_numeric
Usage
diagnose_numeric(.data, ...)
Arguments
.data
dataframe
...
tidyselect. Default: all numeric columns
Value
dataframe
Examples
iris %>% diagnose_numeric() %>% print(width = Inf)
mode_fn
statistical mode
CRAN · 0.1.1 · validata/man/mode_fn.Rd · 2026-05-07

returns the mode of a vector

Aliases
mode_fn
Usage
mode_fn(x)
Arguments
x
a vector
Value
a unit vector
Examples
c("b", "b", letters) %>% mode_fn()
mode_pct
mode with %
CRAN · 0.1.1 · validata/man/mode_pct.Rd · 2026-05-07

returns the mode of a vector with what percent of the data is the mode

Aliases
mode_pct
Usage
mode_pct(x)
Arguments
x
a vector
Value
a unit character vector
Examples
c("b", "b", letters) %>% mode_pct()
n_dupes
CRAN · 0.1.1 · validata/man/n_dupes.Rd · 2026-05-07

n_dupes

Aliases
n_dupes
Usage
n_dupes(x)
Arguments
x
a df
Value
an integer; number of dupe rows
sample_data1
Sample Data
CRAN · 0.1.1 · data · validata/man/sample_data1.Rd · 2026-05-07

Sample Data

Aliases
sample_data1
Keywords
125datainternalrowssamplewith
Usage
sample_data1
Format
6 columns. 3 id and 3 values ID_COL14-5 distinct codes
top_n_vals
top n vals
CRAN · 0.1.1 · validata/man/top_n_vals.Rd · 2026-05-07

top n vals

Aliases
top_n_vals
Usage
top_n_vals(x, top_n = 3)
Arguments
x
vector
top_n
integer to specify top n modes
Value
character unit vector
Examples
tibble::tibble(x = 1:10 %>% c(10,10,10,5,5)) -> t1 t1 %>% top_n_vals()
validata-package
validata: Validate Data Frames
CRAN · 0.1.1 · package · validata/man/validata-package.Rd · 2026-05-07

Functions for validating the structure and properties of data frames. Answers essential questions about a data set after initial import or modification. What are the unique or missing values? What columns form a primary key? What are the properties of the numeric or categorical columns? What kind of overlap or mapping exists between 2 columns?

Aliases
validatavalidata-package
Keywords
internal
See also
Useful links: https://harrison4192.github.io/validata/ https://github.com/Harrison4192/validata Report bugs at https://github.com/Harrison4192/validata/issues
Author
Maintainer: Harrison Tietze Harrison4192@gmail.com
view_missing
CRAN · 0.1.1 · validata/man/view_missing.Rd · 2026-05-07

View rows of the dataframe where columns in the tidyselect specification contain missings by default, detects missings in any column. The result is by default displayed in the viewer pane. Can be returned as a tibble optionally.

Aliases
view_missing
Usage
view_missing(df, ..., view = TRUE)
Arguments
df
dataframe
...
tidyselect
view
logical. if false, returns tibble
Value
tibble
Examples
view_missing(tibble::tibble(x = c(NA, 1)), view = FALSE)

버전 이력

RepositoryVersionPublishedFirst seenLast seenDocs
CRAN0.1.12026-05-282026-05-30

보안

표시할 OSV 데이터가 없습니다.

문헌 신호

표시할 OpenAlex 데이터가 없습니다.