Package 'dfexpand'

Title: Automatically Expand Delimited Column Values into Multiple Binary Columns with 'dfexpand'
Description: Implements an algorithm to effortlessly split a column in an R data frame filled with multiple values separated by delimiters. This automates the process of creating separate columns for each unique value, transforming them into binary outcomes.
Authors: Jeffery Painter [aut, cre]
Maintainer: Jeffery Painter <[email protected]>
License: GPL (>= 3)
Version: 0.0.2
Built: 2024-11-20 05:35:49 UTC
Source: https://github.com/jlpainter/dfexpand

Help Index


Expand a single column containing delimited values into multiple binary columns

Description

Expand a single column containing delimited values into multiple binary columns

Usage

expand_column(
  dataframe,
  colname = NULL,
  delimiter = ";",
  trim = TRUE,
  ignore_case = FALSE,
  colnumber = NULL
)

Arguments

dataframe

The data frame containing the column we want to expand

colname

The name of the column to split on.

delimiter

A single character to split the string on.

ignore_case

Boolean flag if you want the split values to ignore case

colnumber

You can provide the column number in the dataframe to expand, rather than the name

trim

Boolean field to trim white space when searching for unique values

Value

A list of distinct values found in the entry string

Examples

library('dfexpand')
    myDelimiter = ";"

    # Create some fake data with duplicates
    rows = c(
        c("a;b"), c("a;b;c"), c("b;c"), c("d"), c("d")
    )

    # Add to a dataframe
    df = data.frame(rows)

    colnames(df) <- c("myvar")
    #
    # The default behavior is to trim extra whitespace from the extracted values, 
    # but not to alter or change the case of the values. So 'Alpha' is distinct from 'alpha'
    # but ' beta ' is the same as 'beta'. You can override this behavior with
    # the trim and ignore case flags.
    #
    expanded_df = expand_column(df, "myvar", myDelimiter)

dfexpand

Description

Methods to auto-expand a delimited string into a list of unique values

Usage

getDistinctValues(entry, delimiter, trim = TRUE, ignore_case = FALSE)

Arguments

entry

A string to parse.

delimiter

A single character to split the string on.

trim

Boolean flag to signify if the leading and trailing whitespace should be trimmed for each value found.

ignore_case

Boolean flag to indicate if the unique values extracted should ignore case differences or not.

Value

list

A list of distinct values found in the entry string

Examples

values <- getDistinctValues("a;b;c", ';')