readMLData

R 패키지 메타데이터와 수집 신호를 모아 봅니다.

Packages / CRAN / readMLData

readMLData

v0.9-7
Repository CRANLicense GPL-3Lifecycle activeNeeds compilation no
DOI
10.32614/CRAN.package.readMLData

Core Signals

첫 화면에서 판단해야 할 수집 신호를 먼저 배치합니다.

0
표시할 핵심 신호가 없습니다.

Supported Backends

DESCRIPTION에서 감지한 backend 관련 package입니다.

0
backend package 신호가 없습니다.

Quick Facts

기본 메타데이터를 작은 카드와 토큰으로 압축합니다.

profile
Repository
CRAN
Version
0.9-7
License
GPL-3
Lifecycle
active
Needs compilation
no
Last observed
2026-05-30
CRAN
cran.r-project.org/package=readMLData

수집 소스별 패키지 정보

1개 소스
CRAN
0.9-7
2026-05-30
License
GPL-3
Imports
XML
Needs compilation
no
Lifecycle
active
Last observed
2026-05-30 10:45:11

이 패키지가 의존하는 패키지

1개 표시전체 1개
PackageTypeSpec
XML
CRAN · 0.9-7 · 2026-05-30
ImportsXML
1 / 1

이 패키지를 쓰는 패키지

0개 표시전체 0개
PackageTypeSpec
표시할 dependency edge가 없습니다.
1 / 1

패키지 페이지

All links
18
Repository
CRAN
Version
0.9-7
Collected
2026-05-16 18:58:52
Package page
https://cran.r-project.org/web/packages/readMLData/index.html
DOI
10.32614/CRAN.package.readMLData
CRAN checks
https://cran.r-project.org/web/checks/check_results_readMLData.html
Reference HTML
https://cran.r-project.org/web/packages/readMLData/refman/readMLData.html
Reference PDF
https://cran.r-project.org/web/packages/readMLData/readMLData.pdf
Source package
https://cran.r-project.org/src/contrib/readMLData_0.9-7.tar.gz
Archive
https://CRAN.R-project.org/src/contrib/Archive/readMLData
Page fields
Author
Petr Savicky
CRAN Checks
readMLData results
DOI
10.32614/CRAN.package.readMLData
License
GPL-3
Maintainer
Petr Savicky <savicky at cs.cas.cz>
Materials
ChangeLog
NeedsCompilation
no
Old Sources
readMLData archive
Package Source
readMLData_0.9-7.tar.gz
Published
2015-01-13
Reference Manual
readMLData.html , readMLData.pdf
URL
http://www.cs.cas.cz/~savicky/readMLData
Version
0.9-7
Windows Binaries
r-devel: readMLData_0.9-7.zip , r-release: readMLData_0.9-7.zip , r-oldrel: readMLData_0.9-7.zip
MacOS Binaries
r-release (arm64): readMLData_0.9-7.tgz , r-oldrel (arm64): readMLData_0.9-7.tgz , r-release (x86_64): readMLData_0.9-7.tgz , r-oldrel (x86_64): readMLData_0.9-7.tgz
Version
0.9-7
Published
2015-01-13
DOI
10.32614/CRAN.package.readMLData
Author
Petr Savicky
Maintainer
Petr Savicky <savicky at cs.cas.cz>
License
GPL-3
URL
http://www.cs.cas.cz/~savicky/readMLData
NeedsCompilation
no
Materials
ChangeLog
CRAN Checks
readMLData results
Reference Manual
readMLData.html , readMLData.pdf
Package Source
readMLData_0.9-7.tar.gz
Windows Binaries
r-devel: readMLData_0.9-7.zip , r-release: readMLData_0.9-7.zip , r-oldrel: readMLData_0.9-7.zip
MacOS Binaries
r-release (arm64): readMLData_0.9-7.tgz , r-oldrel (arm64): readMLData_0.9-7.tgz , r-release (x86_64): readMLData_0.9-7.tgz , r-oldrel (x86_64): readMLData_0.9-7.tgz
Old Sources
readMLData archive
Page sections 3
Documentation
Heading
Documentation
Links
[{"label":"readMLData.html","section":"","type":"","url":"https://cran.r-project.org/web/packages/readMLData/refman/readMLData.html"},{"label":"readMLData.pdf","section":"","type":"","url":"https://cran.r-project.org/web/packages/readMLData/readMLData.pdf"}]
Text
Reference manual: readMLData.html , readMLData.pdf
Downloads
Heading
Downloads
Links
[{"label":"readMLData_0.9-7.tar.gz","section":"","type":"","url":"https://cran.r-project.org/src/contrib/readMLData_0.9-7.tar.gz"},{"label":"readMLData_0.9-7.zip","section":"","type":"","url":"https://cran.r-project.org/bin/windows/contrib/4.7/readMLData_0.9-7.zip"},{"label":"readMLData_0.9-7.zip","section":"","type":"","url":"https://cran.r-project.org/bin/windows/contrib/4.6/readMLData_0.9-7.zip"},{"label":"readMLData_0.9-7.zip","section":"","type":"","url":"https://cran.r-project.org/bin/windows/contrib/4.5/readMLData_0.9-7.zip"},{"label":"readMLData_0.9-7.tgz","section":"","type":"","url":"https://cran.r-project.org/bin/macosx/sonoma-arm64/contrib/4.6/readMLData_0.9-7.tgz"},{"label":"readMLData_0.9-7.tgz","section":"","type":"","url":"https://cran.r-project.org/bin/macosx/big-sur-arm64/contrib/4.5/readMLData_0.9-7.tgz"},{"label":"readMLData_0.9-7.tgz","section":"","type":"","url":"https://cran.r-project.org/bin/macosx/big-sur-x86_64/contrib/4.6/readMLData_0.9-7.tgz"},{"label":"readMLData_0.9-7.tgz","section":"","type":"","url":"https://cran.r-project.org/bin/macosx/big-sur-x86_64/contrib/4.5/readMLData_0.9-7.tgz"},{"label":"readMLData archive","section":"","type":"","url":"https://CRAN.R-project.org/src/contrib/Archive/readMLData"}]
Text
Package source: readMLData_0.9-7.tar.gz Windows binaries: r-devel: readMLData_0.9-7.zip , r-release: readMLData_0.9-7.zip , r-oldrel: readMLData_0.9-7.zip macOS binaries: r-release (arm64): readMLData_0.9-7.tgz , r-oldrel (arm64): readMLData_0.9-7.tgz , r-release (x86_64): readMLData_0.9-7.tgz , r-oldrel (x86_64): readMLData_0.9-7.tgz Old sources: readMLData archive
Linking
Heading
Linking
Links
[{"label":"https://CRAN.R-project.org/package=readMLData","section":"","type":"","url":"https://CRAN.R-project.org/package=readMLData"}]
Text
Please use the canonical form https://CRAN.R-project.org/package=readMLData to link to this page.
Materials 1
Documentation 2
Downloads 9
All page links 18

패키지 문서 원문

2 artifacts
reference_manual_html
Reference manual HTML
CRAN · 0.9-7 · Documentation · text/html · 27,708 · 2026-05-07
Title
Help for package readMLData
Label
Reference manual HTML
Text content
Text content
Help for package readMLData const macros = { "\\R": "\\textsf{R}", "\\mbox": "\\text", "\\code": "\\texttt"}; function processMathHTML() { var l = document.getElementsByClassName('reqn'); for (let e of l) { katex.render(e.textContent, e, { throwOnError: false, macros }); } return; } Package {readMLData} Contents readMLData-package analyzeData checkConsistency checkType dsDownload dsRead dsSearch dsSort getAvailable getFields getPath getType prepareDSList xml Type: Package Title: Reading Machine Learning Benchmark Data Sets in Different Formats Version: 0.9-7 Date: 2015-01-13 Author: Petr Savicky Maintainer: Petr Savicky <savicky@cs.cas.cz> Description: Functions for reading data sets in different formats for testing machine learning tools are provided. This allows to run a loop over several data sets in their original form, for example if they are downloaded from UCI Machine Learning Repository. The data are not part of the package and have to be downloaded separately. Imports: XML License: GPL-3 URL: http://www.cs.cas.cz/~savicky/readMLData Packaged: 2015-01-13 10:52:09 UTC; savicky NeedsCompilation: no Repository: CRAN Date/Publication: 2015-01-13 12:10:48 Reading data from different sources in their original format. Description The package contains functions, which allow to maintain and use a structure describing a collection of machine learning datasets and read them into R environment using a unified interface, see function prepareDSList() and dsRead() . Details The data are not part of the package. The package requires to receive a path to a local copy of the data and their description. The description of the data sets consists of a directory, which contains an XML file contents.xml and subdirectory "scripts", which contains an R script for each data set, which reads the data set into R. File contents.xml contains information on all the data sets. In particular it contains their names for local identification, their public names, and the names of files representing the data set. The name of the script for reading a data set is derived from its identification name. The complete list of the fields in contents.xml may be obtained using getFields() . For the simplest use of the package for reading the data sets, the functions prepareDSList() and dsRead() are sufficient. The remaining functions are useful for including further data sets to the description. Use help(package=readMLData) or library(help=readMLData) to see the list of functions. The list of fields, which should be included in "contents.xml" , consists of the fields with either usage=="obligatory" or usage=="optional" in the table produced by getFields() . Fields with usage=="additional" and usage=="computed" are included automatically by the function prepareDSList() . An example of the description directory describing three UCI data sets is in exampleDescription subdirectory of the installed package. The data themselves are in exampleData subdirectory. See http://www.cs.cas.cz/~savicky/readMLData/ for description files of further data sets from UCI Machine Learning Repository. Author(s) Petr Savicky References UCI Machine Learning Repository, http://archive.ics.uci.edu/ml/ . Additional resources for the CRAN package readMLData, http://www.cs.cas.cz/~savicky/readMLData/ . Determine the type of values in each column of a data frame. Description For each column, its class and the number of different values is determined. For numeric columns, also the minimum and maximum is computed. Usage analyzeData(dat) Arguments dat A data frame. Value A data frame with columns "class", "num.unique", "min", "max" , which correspond to properties of columns of dat . The rows in the output data frame correspond to the columns of dat . Author(s) Petr Savicky See Also readMLData . Examples pathData <- getPath("exampleData") pathDescription <- getPath("exampleDescription") dsList <- prepareDSList(pathData, pathDescription) dat <- dsRead(dsList, "glass") analyzeData(dat) Checks consistency of the data frame dsList . Description Checks consistency of the parameters specified for each dataset in the dsList data frame created by prepareDSList() . Usage checkConsistency(dsList, outputInd=FALSE) Arguments dsList Data frame as created by prepareDSList() . outputInd Logical. Determines, whether the output should be a vector of indices of the data sets with conflicts. Value Depending on outputInd , either a vector of indices of data sets with a conflict between the specified parameters or NULL invisibly. Author(s) Petr Savicky See Also readMLData . Examples pathData <- getPath("exampleData") pathDescription <- getPath("exampleDescription") dsList <- prepareDSList(pathData, pathDescription) checkConsistency(dsList) Compares the type of columns stored in dsList and in a data set itself. Description Compares types. Usage checkType(dsList, id, dat=NULL) Arguments dsList Data frame describing the data sets as produced by prepareDSList() . id Numeric or character of length one. Index or the identification of a data set. dat An optional data frame as read by dsRead(dsList, id, keepContents=TRUE) . Value The name of the tested data set and the result of the test is printed. If errors are found, a more detailed message is printed. The output value is TRUE or FALSE invisibly according, whether the types are correct or not. Author(s) Petr Savicky See Also readMLData . Examples pathData <- getPath("exampleData") pathDescription <- getPath("exampleDescription") dsList <- prepareDSList(pathData, pathDescription) checkType(dsList, 1) Run an external tool to download a data set. Description The function allows to run an external download tool with arguments read from a file in a data folder. Usage dsDownload(dsList, id, command, fileName) Arguments dsList Data frame as created by prepareDSList() . id Name of the data set in dsList$identification or the index of the row in dsList corresponding to the data set. command Character. A command line web downloding tool, for example "wget" . fileName Character. A name of the file in the data directory, which contains the URL of the data on the web. Details If no data set or more than one data set corresponding to id is found, a corresponding error message is printed. Value Function has no value. The protocol generated by the specified tool is printed. Author(s) Petr Savicky See Also readMLData . Examples ## Not run: pathData <- getPath("exampleData") pathDescription <- getPath("exampleDescription") dsList <- prepareDSList(pathData, pathDescription) dat <- dsDownload(dsList, "glass", "wget", "links.txt") ## End(Not run) Loading machine learning data from a directory tree using a unified interface. Description The function allows to read data sets included in the description in the data frame dsList into R environment using a unified interface. Usage dsRead(dsList, id, responseName = NULL, originalNames=TRUE, deleteUnused=TRUE, keepContents=FALSE) Arguments dsList Data frame as created by prepareDSList() . id Name of the data set in dsList$identification or the index of the row in dsList corresponding to the data set. responseName Character. The required name of the response column in the output data frame created from the data set. originalNames If TRUE, the original names of columns are used, if they are present in the description XML file. deleteUnused Logical. Controls, whether the columns containing case labels or other columns not suitable as attributes, are removed from the data. keepContents Logical. If TRUE , then deleteUnused parameter is ignored and no columns are converted to factors. Details The function uses dsList$avaiable to determine, whether the files for the required data set is present in the local directory dsList$pathData . If not, a corresponding error message is printed. See prepareDSList() and getAvailable() . Value A data frame containing the required data set, possibly transformed according to the setting of the parameters responseName, originalNames, deleteUnused
section
readMLData.pdf
CRAN · 0.9-7 · Documentation · application/pdf · 114,125 · 2026-05-07
Title
readMLData.pdf
Label
readMLData.pdf

Reference for readMLData (0.9-7)

14개 topic
analyzeData
Determine the type of values in each column of a data frame.
CRAN · 0.9-7 · readMLData/man/analyzaData.Rd · 2026-05-07

For each column, its class and the number of different values is determined. For numeric columns, also the minimum and maximum is computed.

Aliases
analyzeData
Keywords
data
Usage
analyzeData(dat)
Arguments
dat
A data frame.
Value
A data frame with columns "class", "num.unique", "min", "max", which correspond to properties of columns of dat. The rows in the output data frame correspond to the columns of dat.
Examples
pathData <- getPath("exampleData") pathDescription <- getPath("exampleDescription") dsList <- prepareDSList(pathData, pathDescription) dat <- dsRead(dsList, "glass") analyzeData(dat)
See also
readMLData.
Author
Petr Savicky
checkConsistency
Checks consistency of the data frame dsList.
CRAN · 0.9-7 · readMLData/man/checkConsistency.Rd · 2026-05-07

Checks consistency of the parameters specified for each dataset in the dsList data frame created by prepareDSList().

Aliases
checkConsistency
Keywords
data
Usage
checkConsistency(dsList, outputInd=FALSE)
Arguments
dsList
Data frame as created by prepareDSList().
outputInd
Logical. Determines, whether the output should be a vector of indices of the data sets with conflicts.
Value
Depending on outputInd, either a vector of indices of data sets with a conflict between the specified parameters or NULL invisibly.
Examples
pathData <- getPath("exampleData") pathDescription <- getPath("exampleDescription") dsList <- prepareDSList(pathData, pathDescription) checkConsistency(dsList)
See also
readMLData.
Author
Petr Savicky
checkType
Compares the type of columns stored in dsList and in a data set itself.
CRAN · 0.9-7 · readMLData/man/checkType.Rd · 2026-05-07

Compares types.

Aliases
checkType
Keywords
data
Usage
checkType(dsList, id, dat=NULL)
Arguments
dsList
Data frame describing the data sets as produced by prepareDSList().
id
Numeric or character of length one. Index or the identification of a data set.
dat
An optional data frame as read by dsRead(dsList, id, keepContents=TRUE).
Value
The name of the tested data set and the result of the test is printed. If errors are found, a more detailed message is printed. The output value is TRUE or FALSE invisibly according, whether the types are correct or not.
Examples
pathData <- getPath("exampleData") pathDescription <- getPath("exampleDescription") dsList <- prepareDSList(pathData, pathDescription) checkType(dsList, 1)
See also
readMLData.
Author
Petr Savicky
dsDownload
Run an external tool to download a data set.
CRAN · 0.9-7 · readMLData/man/dsDownload.Rd · 2026-05-07

The function allows to run an external download tool with arguments read from a file in a data folder.

Aliases
dsDownload
Keywords
data
Usage
dsDownload(dsList, id, command, fileName)
Arguments
dsList
Data frame as created by prepareDSList().
id
Name of the data set in dsList$identification or the index of the row in dsList corresponding to the data set.
command
Character. A command line web downloding tool, for example "wget".
fileName
Character. A name of the file in the data directory, which contains the URL of the data on the web.
Details
If no data set or more than one data set corresponding to id is found, a corresponding error message is printed.
Value
Function has no value. The protocol generated by the specified tool is printed.
Examples
pathData <- getPath("exampleData") pathDescription <- getPath("exampleDescription") dsList <- prepareDSList(pathData, pathDescription) dat <- dsDownload(dsList, "glass", "wget", "links.txt")
See also
readMLData.
Author
Petr Savicky
dsRead
Loading machine learning data from a directory tree using a unified interface.
CRAN · 0.9-7 · readMLData/man/dsRead.Rd · 2026-05-07

The function allows to read data sets included in the description in the data frame dsList into R environment using a unified interface.

Aliases
dsRead
Keywords
data
Usage
dsRead(dsList, id, responseName = NULL, originalNames=TRUE, deleteUnused=TRUE, keepContents=FALSE)
Arguments
dsList
Data frame as created by prepareDSList().
id
Name of the data set in dsList$identification or the index of the row in dsList corresponding to the data set.
responseName
Character. The required name of the response column in the output data frame created from the data set.
originalNames
If TRUE, the original names of columns are used, if they are present in the description XML file.
deleteUnused
Logical. Controls, whether the columns containing case labels or other columns not suitable as attributes, are removed from the data.
keepContents
Logical. If TRUE, then deleteUnused parameter is ignored and no columns are converted to factors.
Details
The function uses dsList$avaiable to determine, whether the files for the required data set is present in the local directory dsList$pathData. If not, a corresponding error message is printed. See prepareDSList() and getAvailable().
Value
A data frame containing the required data set, possibly transformed according to the setting of the parameters responseName, originalNames, deleteUnused. If an error occurred, the function outputs NULL.
Examples
pathData <- getPath("exampleData") pathDescription <- getPath("exampleDescription") dsList <- prepareDSList(pathData, pathDescription) dat <- dsRead(dsList, "glass") dim(dat)
See also
readMLData, prepareDSList, getAvailable.
Author
Petr Savicky
dsSearch
Search a dataset by string matching against the names stored in dsList.
CRAN · 0.9-7 · readMLData/man/dsSearch.Rd · 2026-05-07

The function allows string matching against some of the fields "identification", "fullName", "dirName", "files" of the structure describing the data sets.

Aliases
dsSearch
Keywords
data
Usage
dsSearch(dsList, id, searchField=c("identification", "fullName", "dirName", "files"), searchType=c("exact", "prefix", "suffix", "anywhere"), caseSensitive=FALSE)
Arguments
dsList
Data frame as created by prepareDSList().
id
Character of length one or numeric of length at most nrow(dsList). If character, then it is used as a search string to be matched against the names of datasets. If numeric, it is used as indices of data sets in dsList.
searchField
Character. Name of a column in dsList to be searched.
searchType
Character. Type of search.
caseSensitive
Logical. Whether the search should be case sensitive.
Details
The parameter searchField determines, which column of dsList is searched, parameters searchType and caseSensitive influence the type of search. These three parameters are ignored, if id is numeric. Regular expressions are not used. Matching with searchType="exact" is done with ==, searchType="prefix" and searchType="suffix" are implemented using substr(), searchType="anywhere" is implemented using grep(, fixed=TRUE).
Value
Data frame containing the indices and identification of the matching data sets and the value of the search field, if applicable.
Examples
pathData <- getPath("exampleData") pathDescription <- getPath("exampleDescription") dsList <- prepareDSList(pathData, pathDescription) dsSearch(dsList, "ident", searchField="fullName", searchType="anywhere")
See also
readMLData.
Author
Petr Savicky
dsSort
Sort the rows of a data frame.
CRAN · 0.9-7 · readMLData/man/dsSort.Rd · 2026-05-07

Sort the rows of a data frame lexicographically. This allows to compare two data sets as sets of cases disregarding their order.

Aliases
dsSort
Keywords
data
Usage
dsSort(dat)
Arguments
dat
a dataframe.
Details
The function calls order() with the columns of dat as the sorting criteria.
Value
Data frame, whose rows are reordered by the sorting.
Examples
pathData <- getPath("exampleData") pathDescription <- getPath("exampleDescription") dsList <- prepareDSList(pathData, pathDescription) dat <- dsRead(dsList, "glass") sorted <- dsSort(dat)
See also
readMLData.
Author
Petr Savicky
getAvailable
Checks consistency of the data frame dsList.
CRAN · 0.9-7 · readMLData/man/getAvailable.Rd · 2026-05-07

Checks whether all the files of a specified data set are accesible in a local directory.

Aliases
getAvailable
Keywords
data
Usage
getAvailable(dsList, id=NULL, asLogical=FALSE)
Arguments
dsList
Data frame as created by prepareDSList().
id
Character or numeric vector. A character vector should contain names matching the names dsList$identification. Numeric vector should consist of the indices of the rows in dsList corresponding to the data set. If id=NULL, then all data sets are checked.
asLogical
Logical, whether the output should be a logical vector of the same length as id or a character vector containing the identification of the available data sets.
Details
The test is not completely reliable, since it only verifies that the files with the required file name are accessible. If the files require some transformations after download and these are not performed, the data set is still reported as available. The test uses file names specified in contents.xml file. If these names are by mistake different from the files actually read in the reading scripts, then the test may also yield an incorrect result.
Value
Logical vector of the length length(id) specifying for each component of id the result of the check or a character vector containing the identification of the available data sets.
Examples
pathData <- getPath("exampleData") pathDescription <- getPath("exampleDescription") dsList <- prepareDSList(pathData, pathDescription) getAvailable(dsList)
See also
readMLData.
Author
Petr Savicky
getFields
Prints the information on the fields in the data frame dsList describing the data sets.
CRAN · 0.9-7 · readMLData/man/getFields.Rd · 2026-05-07

The data frame dsList contains names of the data sets, the names of the directories, the files, which belong to each of the data sets, and some other information. The function returns a table describing the fields and their usage.

Aliases
getFields
Keywords
data
Usage
getFields()
Value
Table containing the names, types and usage of the fields expected in dsList.
Examples
pathData <- getPath("exampleData") pathDescription <- getPath("exampleDescription") dsList <- prepareDSList(pathData, pathDescription) getFields()
See also
readMLData.
Author
Petr Savicky
getPath
Determine the path to package example directories.
CRAN · 0.9-7 · readMLData/man/getPath.Rd · 2026-05-07

Appends the path to the directory of an installed package and a name of its subdirectory.

Aliases
getPath
Keywords
data
Usage
getPath(dirName)
Arguments
dirName
Character. Name of the example subdirectory of an installed package. This is currently exampleDescription or exampleData.
Value
Character string, which is a full path to the required example directory in an installed package.
See also
prepareDSList
Author
Petr Savicky
getType
Determines the type vector for an input data set.
CRAN · 0.9-7 · readMLData/man/getType.Rd · 2026-05-07

The type information is derived from the contents of individual columns of an input data frame.

Aliases
getType
Keywords
data
Usage
getType(dat)
Arguments
dat
A data frame.
Value
A character vector of length ncol(dat) containing "n" for numerical columns, the number of different values for character or factor columns, and "o" otherwise.
Examples
pathData <- getPath("exampleData") pathDescription <- getPath("exampleDescription") dsList <- prepareDSList(pathData, pathDescription) dat <- dsRead(dsList, "annealing") getType(dat)
See also
readMLData.
Author
Petr Savicky
prepareDSList
Prepares a data frame dsList, which describes the data contained in a local data description directory.
CRAN · 0.9-7 · readMLData/man/prepareDSList.Rd · 2026-05-07

The data frame dsList is needed to read the data contained in a directory tree below dsList$pathData using dsRead(). The directory pathDescription is expected to contain the file contents.xml and subdirectory scripts with R scripts for reading the data sets.

Aliases
prepareDSList
Keywords
data
Usage
prepareDSList(pathData, pathDescription)
Arguments
pathData
Character. A path to the required data directory.
pathDescription
Character. A path to a directory containing description of the required data, in particular the file "contents.xml".
Details
The character "~" expands to your home directory. The directory pathData need not contain all the data sets included in pathDescription/contents.xml. The function getAvailable() is called and its output is stored in column availability of the output data frame, which is logical and specifies for each data set, whether it is or is not present. See http://www.cs.cas.cz/~savicky/readMLData/ for description files of some of the data sets from UCI Machine Learning Repository. See the help page readMLData for more information on the structure of the description files.
Value
Data frame with columns pathData, pathDescription, and other as listed by getFields(). The output data frame can be used as dsList parametr of functions dsSearch(), dsRead(), checkConsistency(), checkType().
Examples
pathData <- getPath("exampleData") pathDescription <- getPath("exampleDescription") dsList <- prepareDSList(pathData, pathDescription)
See also
readMLData, getAvailable, checkConsistency.
Author
Petr Savicky
readMLData-package
Reading data from different sources in their original format.
CRAN · 0.9-7 · package · readMLData/man/readMLData-package.Rd · 2026-05-07

The package contains functions, which allow to maintain and use a structure describing a collection of machine learning datasets and read them into R environment using a unified interface, see function prepareDSList() and dsRead().

Aliases
readMLData-packagereadMLData
Keywords
package
Details
The data are not part of the package. The package requires to receive a path to a local copy of the data and their description. The description of the data sets consists of a directory, which contains an XML file contents.xml and subdirectory "scripts", which contains an R script for each data set, which reads the data set into R. File contents.xml contains information on all the data sets. In particular it contains their names for local identification, their public names, and the names of files representing the data set. The name of the script for reading a data set is derived from its identification name. The complete list of the fields in contents.xml may be obtained using getFields(). For the simplest use of the package for reading the data sets, the functions prepareDSList() and dsRead() are sufficient. The remaining functions are useful for including further data sets to the description. Use help(package=readMLData) or library(help=readMLData) to see the list of functions. The list of fields, which should be included in "contents.xml", consists of the fields with either usage=="obligatory" or usage=="optional" in the table produced by getFields(). Fields with usage=="additional" and usage=="computed" are included automatically by the function prepareDSList(). An example of the description directory describing three UCI data sets is in exampleDescription subdirectory of the installed package. The data themselves are in exampleData subdirectory. See http://www.cs.cas.cz/~savicky/readMLData/ for description files of further data sets from UCI Machine Learning Repository.
Author
Petr Savicky
References
UCI Machine Learning Repository, http://archive.ics.uci.edu/ml/. Additional resources for the CRAN package readMLData, http://www.cs.cas.cz/~savicky/readMLData/.
xml
Handling XML files.
CRAN · 0.9-7 · readMLData/man/xml.Rd · 2026-05-07

Input and output of a data set description from and to a XML file. These functions are not inteded for direct use by the user for reading the data sets. The function readDSListFromXML() is called from prepareDataDir(). The function saveDSListAsXML is used for preparing the file contents.xml in the data set description directory.

Aliases
readDSListFromXMLsaveDSListAsXML
Keywords
data
Usage
readDSListFromXML(filename) saveDSListAsXML(dsList, filename)
Arguments
dsList
A data frame created by prepareDataDirectory().
filename
The name of an XML file to be used.
Value
saveDSListAsXML() returns the filename of the created file. readDSListFromXML() returns a data frame with the description of the data sets.
See also
readMLData.
Author
Petr Savicky

버전 이력

RepositoryVersionPublishedFirst seenLast seenDocs
CRAN0.9-72026-05-292026-05-30

보안

표시할 OSV 데이터가 없습니다.

문헌 신호

표시할 OpenAlex 데이터가 없습니다.