validata

R 패키지 메타데이터와 수집 신호를 모아 봅니다.

Packages / CRAN / validata

validata

v0.1.1

validata

Repository CRANLicense MIT + file LICENSELifecycle activeNeeds compilation no

Package page Reference Source DOI README NEWS

DOI

10.32614/CRAN.package.validata

Reverse imports

100

Core Signals

첫 화면에서 판단해야 할 수집 신호를 먼저 배치합니다.

Reverse imports

100

Supported Backends

DESCRIPTION에서 감지한 backend 관련 package입니다.

data.table

Imports

Quick Facts

기본 메타데이터를 작은 카드와 토큰으로 압축합니다.

profile

Repository

CRAN

Version

0.1.1

License

MIT + file LICENSE

Lifecycle

active

Needs compilation

Reverse imports

100

Last observed

2026-07-10

CRAN: cran.r-project.org/package=validata

수집 소스별 패키지 정보

1개 소스

CRAN

0.1.1

2026-07-10

License

MIT + file LICENSE

Depends

R (>= 2.10)

Imports

dplyr, stringr, janitor, rlang, tidyselect, purrr, magrittr, tidyr, tibble, gtools, listviewer, data.table, scales, utils, framecleaner, badger, rlist

Suggests

knitr, rmarkdown, testit

Needs compilation

Reverse imports

100

Lifecycle

active

Last observed

2026-07-10 02:43:42

이 패키지가 의존하는 패키지

5개 표시전체 20개

Package	Type	Spec
badger CRAN · 0.1.1 · 2026-07-24	Imports	badger
data.table CRAN · 0.1.1 · 2026-07-24	Imports	data.table
dplyr CRAN · 0.1.1 · 2026-07-24	Imports	dplyr
framecleaner CRAN · 0.1.1 · 2026-07-24	Imports	framecleaner
gtools CRAN · 0.1.1 · 2026-07-24	Imports	gtools
janitor CRAN · 0.1.1 · 2026-07-24	Imports	janitor
listviewer CRAN · 0.1.1 · 2026-07-24	Imports	listviewer
magrittr CRAN · 0.1.1 · 2026-07-24	Imports	magrittr
purrr CRAN · 0.1.1 · 2026-07-24	Imports	purrr
rlang CRAN · 0.1.1 · 2026-07-24	Imports	rlang
rlist CRAN · 0.1.1 · 2026-07-24	Imports	rlist
scales CRAN · 0.1.1 · 2026-07-24	Imports	scales
stringr CRAN · 0.1.1 · 2026-07-24	Imports	stringr
tibble CRAN · 0.1.1 · 2026-07-24	Imports	tibble
tidyr CRAN · 0.1.1 · 2026-07-24	Imports	tidyr
tidyselect CRAN · 0.1.1 · 2026-07-24	Imports	tidyselect
utils CRAN · 0.1.1 · 2026-07-24	Imports	utils
knitr CRAN · 0.1.1 · 2026-07-24	Suggests	knitr
rmarkdown CRAN · 0.1.1 · 2026-07-24	Suggests	rmarkdown
testit CRAN · 0.1.1 · 2026-07-24	Suggests	testit
검색 결과가 없습니다.

1 / 4

이 패키지를 쓰는 패키지

1개 표시전체 1개

Package	Type	Spec
TidyConsultant 0.1.2 CRAN · 2026-07-24	Imports	validata
검색 결과가 없습니다.

1 / 1

Reverse dependency summary

1 types

Type	Packages
Imports	1

패키지 페이지

Reverse imports

All links

Repository: CRAN
Version: 0.1.1
Collected: 2026-05-20 06:53:49
Package page: https://cran.r-project.org/web/packages/validata/index.html
DOI: 10.32614/CRAN.package.validata
CRAN checks: https://cran.r-project.org/web/checks/check_results_validata.html
README: https://cran.r-project.org/web/packages/validata/readme/README.html
NEWS: https://cran.r-project.org/web/packages/validata/news/news.html
Reference HTML: https://cran.r-project.org/web/packages/validata/refman/validata.html
Reference PDF: https://cran.r-project.org/web/packages/validata/validata.pdf
Source package: https://cran.r-project.org/src/contrib/validata_0.1.1.tar.gz
Archive: https://CRAN.R-project.org/src/contrib/Archive/validata

Page fields

Author: Harrison Tietze [aut, cre]
BugReports: https://github.com/Harrison4192/validata/issues
CRAN Checks: validata results
DOI: 10.32614/CRAN.package.validata
License: MIT + file LICENSE
Maintainer: Harrison Tietze <Harrison4192 at gmail.com>
Materials: README , NEWS
NeedsCompilation: no
Old Sources: validata archive
Package Source: validata_0.1.1.tar.gz
Published: 2026-02-26
Reference Manual: validata.html , validata.pdf
Reverse Imports: TidyConsultant
URL: https://harrison4192.github.io/validata/ , https://github.com/Harrison4192/validata
Version: 0.1.1
Vignettes: validata ( source , R code )
Windows Binaries: r-devel: validata_0.1.1.zip , r-release: validata_0.1.1.zip , r-oldrel: validata_0.1.1.zip
MacOS Binaries: r-release (arm64): validata_0.1.1.tgz , r-oldrel (arm64): validata_0.1.1.tgz , r-release (x86_64): validata_0.1.1.tgz , r-oldrel (x86_64): validata_0.1.1.tgz

Version

0.1.1

Published

2026-02-26

DOI

10.32614/CRAN.package.validata

Author

Harrison Tietze [aut, cre]

Maintainer

Harrison Tietze <Harrison4192 at gmail.com>

BugReports

https://github.com/Harrison4192/validata/issues

License

MIT + file LICENSE

MIT LICENSE

URL

https://harrison4192.github.io/validata/ , https://github.com/Harrison4192/validata

https://harrison4192.github.io/validata/https://github.com/Harrison4192/validata

NeedsCompilation

Materials

README , NEWS

README NEWS

CRAN Checks

validata results

Reference Manual

validata.html , validata.pdf

validata.html validata.pdf

Vignettes

validata ( source , R code )

validata source R code

Package Source

validata_0.1.1.tar.gz

Windows Binaries

r-devel: validata_0.1.1.zip , r-release: validata_0.1.1.zip , r-oldrel: validata_0.1.1.zip

validata_0.1.1.zip validata_0.1.1.zip validata_0.1.1.zip

MacOS Binaries

r-release (arm64): validata_0.1.1.tgz , r-oldrel (arm64): validata_0.1.1.tgz , r-release (x86_64): validata_0.1.1.tgz , r-oldrel (x86_64): validata_0.1.1.tgz

validata_0.1.1.tgz validata_0.1.1.tgz validata_0.1.1.tgz validata_0.1.1.tgz

Old Sources

validata archive

Reverse Imports

TidyConsultant

Page sections 4

Documentation

Heading

Documentation

Links

[{"label":"validata.html","section":"","type":"","url":"https://cran.r-project.org/web/packages/validata/refman/validata.html"},{"label":"validata.pdf","section":"","type":"","url":"https://cran.r-project.org/web/packages/validata/validata.pdf"},{"label":"validata","section":"","type":"","url":"https://cran.r-project.org/web/packages/validata/vignettes/validata.html"},{"label":"source","section":"","type":"","url":"https://cran.r-project.org/web/packages/validata/vignettes/validata.Rmd"},{"label":"R code","section":"","type":"","url":"https://cran.r-project.org/web/packages/validata/vignettes/validata.R"}]

Text

Reference manual: validata.html , validata.pdf Vignettes: validata ( source , R code )

Downloads

Heading

Downloads

Links

[{"label":"validata_0.1.1.tar.gz","section":"","type":"","url":"https://cran.r-project.org/src/contrib/validata_0.1.1.tar.gz"},{"label":"validata_0.1.1.zip","section":"","type":"","url":"https://cran.r-project.org/bin/windows/contrib/4.7/validata_0.1.1.zip"},{"label":"validata_0.1.1.zip","section":"","type":"","url":"https://cran.r-project.org/bin/windows/contrib/4.6/validata_0.1.1.zip"},{"label":"validata_0.1.1.zip","section":"","type":"","url":"https://cran.r-project.org/bin/windows/contrib/4.5/validata_0.1.1.zip"},{"label":"validata_0.1.1.tgz","section":"","type":"","url":"https://cran.r-project.org/bin/macosx/sonoma-arm64/contrib/4.6/validata_0.1.1.tgz"},{"label":"validata_0.1.1.tgz","section":"","type":"","url":"https://cran.r-project.org/bin/macosx/big-sur-arm64/contrib/4.5/validata_0.1.1.tgz"},{"label":"validata_0.1.1.tgz","section":"","type":"","url":"https://cran.r-project.org/bin/macosx/big-sur-x86_64/contrib/4.6/validata_0.1.1.tgz"},{"label":"validata_0.1.1.tgz","section":"","type":"","url":"https://cran.r-project.org/bin/macosx/big-sur-x86_64/contrib/4.5/validata_0.1.1.tgz"},{"label":"validata archive","section":"","type":"","url":"https://CRAN.R-project.org/src/contrib/Archive/validata"}]

Text

Package source: validata_0.1.1.tar.gz Windows binaries: r-devel: validata_0.1.1.zip , r-release: validata_0.1.1.zip , r-oldrel: validata_0.1.1.zip macOS binaries: r-release (arm64): validata_0.1.1.tgz , r-oldrel (arm64): validata_0.1.1.tgz , r-release (x86_64): validata_0.1.1.tgz , r-oldrel (x86_64): validata_0.1.1.tgz Old sources: validata archive

Reverse dependencies

Heading

Reverse dependencies

Links

[{"label":"TidyConsultant","section":"","type":"","url":"https://cran.r-project.org/web/packages/TidyConsultant/index.html"}]

Text

Reverse imports: TidyConsultant

Linking

Heading

Linking

Links

[{"label":"https://CRAN.R-project.org/package=validata","section":"","type":"","url":"https://CRAN.R-project.org/package=validata"}]

Text

Please use the canonical form https://CRAN.R-project.org/package=validata to link to this page.

Materials 2

README NEWS

Documentation 5

validata.html validata.pdf validata source R code

Vignettes 3

validata source R code

Downloads 9

validata_0.1.1.tar.gz validata_0.1.1.zip validata_0.1.1.zip validata_0.1.1.zip validata_0.1.1.tgz validata_0.1.1.tgz validata_0.1.1.tgz validata_0.1.1.tgz validata archive

All page links 45

패키지 문서 원문

4 artifacts

field

NEWS

CRAN · 0.1.1 · Materials · text/html · 859 · 2026-05-07

Title

NEWS

Label

NEWS

Text content

NEWS code{white-space: pre-wrap;} span.smallcaps{font-variant: small-caps;} span.underline{text-decoration: underline;} div.column{display: inline-block; vertical-align: top; width: 50%;} div.hanging-indent{margin-left: 1.5em; text-indent: -1.5em;} ul.task-list{list-style: none;} validata 0.1.1 validata 0.1.0 Added a NEWS.md file to track changes to the package.

field

README

CRAN · 0.1.1 · Materials · text/html · 6,109 · 2026-05-07

Title

README

Label

README

Text content

README code{white-space: pre-wrap;} span.smallcaps{font-variant: small-caps;} span.underline{text-decoration: underline;} div.column{display: inline-block; vertical-align: top; width: 50%;} div.hanging-indent{margin-left: 1.5em; text-indent: -1.5em;} ul.task-list{list-style: none;} pre > code.sourceCode { white-space: pre; position: relative; } pre > code.sourceCode > span { display: inline-block; line-height: 1.25; } pre > code.sourceCode > span:empty { height: 1.2em; } .sourceCode { overflow: visible; } code.sourceCode > span { color: inherit; text-decoration: inherit; } div.sourceCode { margin: 1em 0; } pre.sourceCode { margin: 0; } @media screen { div.sourceCode { overflow: auto; } } @media print { pre > code.sourceCode { white-space: pre-wrap; } pre > code.sourceCode > span { text-indent: -5em; padding-left: 5em; } } pre.numberSource code { counter-reset: source-line 0; } pre.numberSource code > span { position: relative; left: -4em; counter-increment: source-line; } pre.numberSource code > span > a:first-child::before { content: counter(source-line); position: relative; left: -1em; text-align: right; vertical-align: baseline; border: none; display: inline-block; -webkit-touch-callout: none; -webkit-user-select: none; -khtml-user-select: none; -moz-user-select: none; -ms-user-select: none; user-select: none; padding: 0 4px; width: 4em; color: #aaaaaa; } pre.numberSource { margin-left: 3em; border-left: 1px solid #aaaaaa; padding-left: 4px; } div.sourceCode { } @media screen { pre > code.sourceCode > span > a:first-child::before { text-decoration: underline; } } code span.al { color: #ff0000; font-weight: bold; } /* Alert */ code span.an { color: #60a0b0; font-weight: bold; font-style: italic; } /* Annotation */ code span.at { color: #7d9029; } /* Attribute */ code span.bn { color: #40a070; } /* BaseN */ code span.bu { color: #008000; } /* BuiltIn */ code span.cf { color: #007020; font-weight: bold; } /* ControlFlow */ code span.ch { color: #4070a0; } /* Char */ code span.cn { color: #880000; } /* Constant */ code span.co { color: #60a0b0; font-style: italic; } /* Comment */ code span.cv { color: #60a0b0; font-weight: bold; font-style: italic; } /* CommentVar */ code span.do { color: #ba2121; font-style: italic; } /* Documentation */ code span.dt { color: #902000; } /* DataType */ code span.dv { color: #40a070; } /* DecVal */ code span.er { color: #ff0000; font-weight: bold; } /* Error */ code span.ex { } /* Extension */ code span.fl { color: #40a070; } /* Float */ code span.fu { color: #06287e; } /* Function */ code span.im { color: #008000; font-weight: bold; } /* Import */ code span.in { color: #60a0b0; font-weight: bold; font-style: italic; } /* Information */ code span.kw { color: #007020; font-weight: bold; } /* Keyword */ code span.op { color: #666666; } /* Operator */ code span.ot { color: #007020; } /* Other */ code span.pp { color: #bc7a00; } /* Preprocessor */ code span.sc { color: #4070a0; } /* SpecialChar */ code span.ss { color: #bb6688; } /* SpecialString */ code span.st { color: #4070a0; } /* String */ code span.va { color: #19177c; } /* Variable */ code span.vs { color: #4070a0; } /* VerbatimString */ code span.wa { color: #60a0b0; font-weight: bold; font-style: italic; } /* Warning */ validata The goal of validata is to provide functions for validating the structure and properties of data frames. Installation You can install the released version of validata from CRAN with: install.packages ( "validata" ) And the development version from GitHub with: # install.packages("devtools") devtools :: install_github ( "Harrison4192/validata" )

reference_manual_html

Reference manual HTML

CRAN · 0.1.1 · Documentation · text/html · 23,048 · 2026-05-07

Title

Help for package validata

Label

Reference manual HTML

Text content

Help for package validata const macros = { "\\R": "\\textsf{R}", "\\mbox": "\\text", "\\code": "\\texttt"}; function processMathHTML() { var l = document.getElementsByClassName('reqn'); for (let e of l) { katex.render(e.textContent, e, { throwOnError: false, macros }); } return; } Package {validata} Contents validata-package %>% confirm_distinct confirm_mapping confirm_overlap confirm_overlap_internal confirm_strlen data_mode determine_distinct determine_mapping determine_overlap diagnose diagnose_category diagnose_missing diagnose_numeric mode_fn mode_pct n_dupes sample_data1 top_n_vals view_missing Title: Validate Data Frames Version: 0.1.1 Maintainer: Harrison Tietze <Harrison4192@gmail.com> Description: Functions for validating the structure and properties of data frames. Answers essential questions about a data set after initial import or modification. What are the unique or missing values? What columns form a primary key? What are the properties of the numeric or categorical columns? What kind of overlap or mapping exists between 2 columns? License: MIT + file LICENSE URL: https://harrison4192.github.io/validata/ , https://github.com/Harrison4192/validata BugReports: https://github.com/Harrison4192/validata/issues Encoding: UTF-8 LazyData: true RoxygenNote: 7.3.3 Imports: dplyr, stringr, janitor, rlang, tidyselect, purrr, magrittr, tidyr, tibble, gtools, listviewer, data.table, scales, utils, framecleaner, badger, rlist Suggests: knitr, rmarkdown, testit VignetteBuilder: knitr Depends: R (≥ 2.10) NeedsCompilation: no Packaged: 2026-02-25 14:06:29 UTC; harrisontietze Author: Harrison Tietze [aut, cre] Repository: CRAN Date/Publication: 2026-02-26 10:00:02 UTC validata: Validate Data Frames Description Functions for validating the structure and properties of data frames. Answers essential questions about a data set after initial import or modification. What are the unique or missing values? What columns form a primary key? What are the properties of the numeric or categorical columns? What kind of overlap or mapping exists between 2 columns? Author(s) Maintainer : Harrison Tietze Harrison4192@gmail.com See Also Useful links: https://harrison4192.github.io/validata/ https://github.com/Harrison4192/validata Report bugs at https://github.com/Harrison4192/validata/issues Pipe operator Description See magrittr:: %>% for details. Usage lhs %>% rhs Confirm Distinct Description Confirm whether the rows of a data frame can be uniquely identified by the keys in the selected columns. Also reports whether the dataframe has duplicates. If so, it is best to remove duplicates and re-run the function. Usage confirm_distinct(.data, ...) Arguments .data A dataframe ... (ID) columns Value a Logical value invisibly with description printed to console Examples iris %>% confirm_distinct(Species, Sepal.Width) Confirm structural mapping between 2 columns Description The mapping between elements of 2 columns can have 4 different relationships: one - one, one - many, many - one, many - many. This function returns a view of the mappings by row, and prints a summary to the console. Usage confirm_mapping(.data, col1, col2, view = T) Arguments .data a data frame col1 column 1 col2 column 2 view View results? Value A view of mappings. Also returns the view as a data frame invisibly. Examples iris %>% confirm_mapping(Species, Sepal.Width, view = FALSE) Confirm Overlap Description Prints a venn-diagram style summary of the unique value overlap between two columns and also invisibly returns a dataframe that can be assigned to a variable and queried with the overlap helpers. The helpers can return values that appeared only the first col, second col, or both cols. Usage confirm_overlap(vec1, vec2, return_tibble = F) co_find_only_in_1(co_output) co_find_only_in_2(co_output) co_find_in_both(co_output) Arguments vec1 vector 1 vec2 vector 2 return_tibble logical. If TRUE, returns a tibble. otherwise by default returns the database invisibly to be queried by helper functions. co_output dataframe output from confirm_overlap Value tibble. overlap summary or overlap table Examples confirm_overlap(iris$Sepal.Width, iris$Sepal.Length) -> iris_overlap iris_overlap iris_overlap %>% co_find_only_in_1() iris_overlap %>% co_find_only_in_2() iris_overlap %>% co_find_in_both() Confirm Overlap internal Description A venn style summary of the overlap in unique values of 2 vectors Usage confirm_overlap_internal(vec1, vec2) Arguments vec1 vector 1 vec2 vector 2 Value 1 row tibble Examples confirm_overlap(iris$Sepal.Width, iris$Sepal.Length) confirm string length Description returns a count table of string lengths for a character column. The helper function choose_strlen filters dataframe for rows containing specific string length for the specified column. Usage confirm_strlen(mdb, col) choose_strlen(cs_output, len) Arguments mdb dataframe col unquoted column cs_output dataframe. output from confirm_strlen len integer vector. Value prints a summary and returns a dataframe invisibly dataframe with original columns, filtered to the specific string length Examples iris %>% tibble::as_tibble() %>% confirm_strlen(Species) -> iris_cs_output iris_cs_output iris_cs_output %>% choose_strlen(6) data_mode Description data_mode Usage data_mode(x, prop = TRUE) Arguments x vector prop show frequency as ratio? default T Value named double of length 1 Automatically determine primary key Description Uses confirm_distinct in an iterative fashion to determine the primary keys. Usage determine_distinct(df, ..., listviewer = TRUE) Arguments df a data frame ... columns or a tidyselect specification. defaults to everything listviewer logical. defaults to TRUE to view output using the listviewer package Details The goal of this function is to automatically determine which columns uniquely identify the rows of a dataframe. The output is a printed description of the combination of columns that form unique identifiers at each level. At level 1, the function tests if individual columns are primary keys At level 2, the function tests n C 2 combinations of columns to see if they form primary keys. The final level is testing all columns at once. For completely unique columns, they are recorded in level 1, but then dropped from the data frame to facilitate the determination of multi-column primary keys. If the dataset contains duplicated rows, they are eliminated before proceeding. Value list Examples sample_data1 %>% head ## on level 1, each column is tested as a unique identifier. the VAL columns have no ## duplicates and hence qualify, even though they normally would be considered as IDs ## on level 3, combinations of 3 columns are tested. implying that ID_COL 1,2,3 form a unique key ## level 2 does not appear, implying that combinations of any 2 ID_COLs do not form a unique key sample_data1 %>% determine_distinct(listviewer = FALSE) Determine pairwise structural mappings Description Determine pairwise structural mappings Usage determine_mapping(df, ..., listviewer = TRUE) Arguments df a data frame ... columns or a tidyselect specification listviewer logical. defaults to TRUE to view output using the listviewer package Value description of mappings Examples iris %>% determine_mapping(listviewer = FALSE) Determine Overlap Description Uses confirm_overlap in a pairise fashion to see venn style comparison of unique values between the columns chosen by a tidyselect specification. Usage determine_overlap(db, ...) Arguments db a data frame ... tidyselect specification. Default being everything. Value tibble Examples iris %>% determine_overlap() diagnose Description Pipe in a dataframe to return a diagnosis of its missing and unique values for each columns. Default behavior is to diagnose all columns, but a subset can be specified in the dots with tidyselect. Usage diagnose(df, ...) Arguments df dataframe ... tidyselect Details this function is inspired by the excellent dlookr package. It takes a dataframe and returns a summary of uni

section

validata.pdf

CRAN · 0.1.1 · Documentation · application/pdf · 97,385 · 2026-05-07

Title

validata.pdf

Label

validata.pdf

Reference for validata (0.1.1)

21개 topic

%>%

Pipe operator

CRAN · 0.1.1 · validata/man/pipe.Rd · 2026-05-07

See magrittr::[magrittr:pipe]%>% for details.

Aliases

%>%

Keywords

internal

Usage

lhs %>% rhs

confirm_distinct

Confirm Distinct

CRAN · 0.1.1 · validata/man/confirm_distinct.Rd · 2026-05-07

Confirm whether the rows of a data frame can be uniquely identified by the keys in the selected columns. Also reports whether the dataframe has duplicates. If so, it is best to remove duplicates and re-run the function.

Aliases

confirm_distinct

Usage

confirm_distinct(.data, ...)

Arguments

.data

A dataframe

...

(ID) columns

Value

a Logical value invisibly with description printed to console

Examples

iris %>% confirm_distinct(Species, Sepal.Width)

confirm_mapping

Confirm structural mapping between 2 columns

CRAN · 0.1.1 · validata/man/confirm_mapping.Rd · 2026-05-07

The mapping between elements of 2 columns can have 4 different relationships: one - one, one - many, many - one, many - many. This function returns a view of the mappings by row, and prints a summary to the console.

Aliases

confirm_mapping

Usage

confirm_mapping(.data, col1, col2, view = T)

Arguments

.data

a data frame

col1

column 1

col2

column 2

view

View results?

Value

A view of mappings. Also returns the view as a data frame invisibly.

Examples

iris %>% confirm_mapping(Species, Sepal.Width, view = FALSE)

confirm_overlap

Confirm Overlap

CRAN · 0.1.1 · validata/man/confirm_overlap.Rd · 2026-05-07

Prints a venn-diagram style summary of the unique value overlap between two columns and also invisibly returns a dataframe that can be assigned to a variable and queried with the overlap helpers. The helpers can return values that appeared only the first col, second col, or both cols.

Aliases

confirm_overlapco_find_only_in_1co_find_only_in_2co_find_in_both

Usage

confirm_overlap(vec1, vec2, return_tibble = F) co_find_only_in_1(co_output) co_find_only_in_2(co_output) co_find_in_both(co_output)

Arguments

vec1

vector 1

vec2

vector 2

return_tibble

logical. If TRUE, returns a tibble. otherwise by default returns the database invisibly to be queried by helper functions.

co_output

dataframe output from confirm_overlap

Value

tibble. overlap summary or overlap table

Examples

confirm_overlap(iris$Sepal.Width, iris$Sepal.Length) -> iris_overlap iris_overlap iris_overlap %>% co_find_only_in_1() iris_overlap %>% co_find_only_in_2() iris_overlap %>% co_find_in_both()

confirm_overlap_internal

Confirm Overlap internal

CRAN · 0.1.1 · validata/man/confirm_overlap_internal.Rd · 2026-05-07

A venn style summary of the overlap in unique values of 2 vectors

Aliases

confirm_overlap_internal

Keywords

internal

Usage

confirm_overlap_internal(vec1, vec2)

Arguments

vec1

vector 1

vec2

vector 2

Value

1 row tibble

Examples

confirm_overlap(iris$Sepal.Width, iris$Sepal.Length)

confirm_strlen

confirm string length

CRAN · 0.1.1 · validata/man/confirm_strlen.Rd · 2026-05-07

returns a count table of string lengths for a character column. The helper function choose_strlen filters dataframe for rows containing specific string length for the specified column.

Aliases

confirm_strlenchoose_strlen

Usage

confirm_strlen(mdb, col) choose_strlen(cs_output, len)

Arguments

mdb

dataframe

col

unquoted column

cs_output

dataframe. output from confirm_strlen

len

integer vector.

Value

prints a summary and returns a dataframe invisibly dataframe with original columns, filtered to the specific string length

Examples

iris %>% tibble::as_tibble() %>% confirm_strlen(Species) -> iris_cs_output iris_cs_output iris_cs_output %>% choose_strlen(6)

data_mode

CRAN · 0.1.1 · validata/man/data_mode.Rd · 2026-05-07

data_mode

Aliases

data_mode

Keywords

internal

Usage

data_mode(x, prop = TRUE)

Arguments

vector

prop

show frequency as ratio? default T

Value

named double of length 1

determine_distinct

Automatically determine primary key

CRAN · 0.1.1 · validata/man/determine_distinct.Rd · 2026-05-07

Uses confirm_distinct in an iterative fashion to determine the primary keys.

Aliases

determine_distinct

Usage

determine_distinct(df, ..., listviewer = TRUE)

Arguments

a data frame

...

columns or a tidyselect specification. defaults to everything

listviewer

logical. defaults to TRUE to view output using the listviewer package

Details

The goal of this function is to automatically determine which columns uniquely identify the rows of a dataframe. The output is a printed description of the combination of columns that form unique identifiers at each level. At level 1, the function tests if individual columns are primary keys At level 2, the function tests n C 2 combinations of columns to see if they form primary keys. The final level is testing all columns at once. For completely unique columns, they are recorded in level 1, but then dropped from the data frame to facilitate the determination of multi-column primary keys. If the dataset contains duplicated rows, they are eliminated before proceeding.

Value

list

Examples

sample_data1 %>% head ## on level 1, each column is tested as a unique identifier. the VAL columns have no ## duplicates and hence qualify, even though they normally would be considered as IDs ## on level 3, combinations of 3 columns are tested. implying that ID_COL 1,2,3 form a unique key ## level 2 does not appear, implying that combinations of any 2 ID_COLs do not form a unique key sample_data1 %>% determine_distinct(listviewer = FALSE)

determine_mapping

Determine pairwise structural mappings

CRAN · 0.1.1 · validata/man/determine_mapping.Rd · 2026-05-07

Determine pairwise structural mappings

Aliases

determine_mapping

Usage

determine_mapping(df, ..., listviewer = TRUE)

Arguments

a data frame

...

columns or a tidyselect specification

listviewer

logical. defaults to TRUE to view output using the listviewer package

Value

description of mappings

Examples

iris %>% determine_mapping(listviewer = FALSE)

determine_overlap

Determine Overlap

CRAN · 0.1.1 · validata/man/determine_overlap.Rd · 2026-05-07

Uses confirm_overlap in a pairise fashion to see venn style comparison of unique values between the columns chosen by a tidyselect specification.

Aliases

determine_overlap

Usage

determine_overlap(db, ...)

Arguments

a data frame

...

tidyselect specification. Default being everything.

Value

tibble

Examples

iris %>% determine_overlap()

diagnose

CRAN · 0.1.1 · validata/man/diagnose.Rd · 2026-05-07

Pipe in a dataframe to return a diagnosis of its missing and unique values for each columns. Default behavior is to diagnose all columns, but a subset can be specified in the dots with tidyselect.

Aliases

diagnose

Usage

diagnose(df, ...)

Arguments

dataframe

...

tidyselect

Details

this function is inspired by the excellent https://choonghyunryu.github.io/dlookr/dlookr package. It takes a dataframe and returns a summary of unique and missing values of the columns.

Value

dataframe summary

Examples

diagnose(iris)

diagnose_category

diagnose category

CRAN · 0.1.1 · validata/man/diagnose_category.Rd · 2026-05-07

counts the distinct entries of categorical variables. The max_distinct argument limits the scope to categorical variables with a maximum number of unique entries, to prevent overflow.

Aliases

diagnose_category

Usage

diagnose_category(.data, ..., max_distinct = 5)

Arguments

.data

dataframe

...

tidyselect

max_distinct

integer

Value

dataframe

Examples

diagnose_category(iris)

diagnose_missing

CRAN · 0.1.1 · validata/man/diagnose_missing.Rd · 2026-05-07

faster than diagnose if emphasis is on diagnosing missing values. Also, only shows the columns with any missing values.

Aliases

diagnose_missing

Usage

diagnose_missing(df, ...)

Arguments

dataframe

...

optional tidyselect

Value

tibble summary

Examples

diagnose_missing(tibble::tibble(x = c(NA, 1)))

diagnose_numeric

CRAN · 0.1.1 · validata/man/diagnose_numeric.Rd · 2026-05-07

Inputs a dataframe and returns various summary statistics of the numeric columns. For example zeros returns the ratio of 0 values in that column. minus counts negative values and infs counts Inf values. Other rarer metrics are also returned that may be helpful for quick diagnosis or understanding of numeric data. mode returns the most common value in the column (chooses at random in case of tie) , and mode_ratio returns its frequency as a ratio of the total rows

Aliases

diagnose_numeric

Usage

diagnose_numeric(.data, ...)

Arguments

.data

dataframe

...

tidyselect. Default: all numeric columns

Value

dataframe

Examples

iris %>% diagnose_numeric() %>% print(width = Inf)

mode_fn

statistical mode

CRAN · 0.1.1 · validata/man/mode_fn.Rd · 2026-05-07

returns the mode of a vector

Aliases

mode_fn

Usage

mode_fn(x)

Arguments

a vector

Value

a unit vector

Examples

c("b", "b", letters) %>% mode_fn()

mode_pct

mode with %

CRAN · 0.1.1 · validata/man/mode_pct.Rd · 2026-05-07

returns the mode of a vector with what percent of the data is the mode

Aliases

mode_pct

Usage

mode_pct(x)

Arguments

a vector

Value

a unit character vector

Examples

c("b", "b", letters) %>% mode_pct()

n_dupes

CRAN · 0.1.1 · validata/man/n_dupes.Rd · 2026-05-07

n_dupes

Aliases

n_dupes

Usage

n_dupes(x)

Arguments

a df

Value

an integer; number of dupe rows

sample_data1

Sample Data

CRAN · 0.1.1 · data · validata/man/sample_data1.Rd · 2026-05-07

Sample Data

Aliases

sample_data1

Keywords

125datainternalrowssamplewith

Usage

sample_data1

Format

6 columns. 3 id and 3 values ID_COL14-5 distinct codes

top_n_vals

top n vals

CRAN · 0.1.1 · validata/man/top_n_vals.Rd · 2026-05-07

top n vals

Aliases

top_n_vals

Usage

top_n_vals(x, top_n = 3)

Arguments

vector

top_n

integer to specify top n modes

Value

character unit vector

Examples

tibble::tibble(x = 1:10 %>% c(10,10,10,5,5)) -> t1 t1 %>% top_n_vals()

validata-package

validata: Validate Data Frames

CRAN · 0.1.1 · package · validata/man/validata-package.Rd · 2026-05-07

Functions for validating the structure and properties of data frames. Answers essential questions about a data set after initial import or modification. What are the unique or missing values? What columns form a primary key? What are the properties of the numeric or categorical columns? What kind of overlap or mapping exists between 2 columns?

Aliases

validatavalidata-package

Keywords

internal

버전 이력

Repository	Version	Published	First seen	Last seen	Docs
CRAN	0.1.0	2021-10-05	2026-05-31	2026-07-19
CRAN	0.1.1		2026-05-31	2026-07-10

보안

표시할 OSV 데이터가 없습니다.

문헌 신호

표시할 OpenAlex 데이터가 없습니다.