Find Most Representative Setlists using K-means Clustering — find_representative_setlists

This function identifies a specified number of shows that represent the variety of setlists in the dataset, using k-means clustering.

Usage

find_representative_setlists_kmeans(show_sequences_all, n_representatives = 30)

Arguments

show_sequences_all

A data frame containing at least two columns:

sequence: A character vector of setlist sequences.
Other columns containing show information.

n_representatives

An integer specifying the number of representative setlists to find. Default is 30.

Value

A data frame containing the most representative setlists, with the same structure as the input data frame.

Examples

if (FALSE) { # \dontrun{
# Assuming show_sequences_all is your dataset
representative_shows <- find_representative_setlists_kmeans(show_sequences_all, n_representatives = 30)
print(representative_shows)
} # }