kmer_fingerprint {proteinKit} R Documentation

Calculate the morgan fingerprint based on the k-mer graph data

Description


Generates fixed-length molecular fingerprint vectors from k-mer graphs using
Morgan algorithm with circular topology hashing.

Usage

kmer_fingerprint(graph,
    radius = 3,
    len = 4096);

Arguments

graph

The k-mer graph object to fingerprint. [as KMerGraph]

radius

Neighborhood radius for structural feature capture. Larger values consider more distant node relationships. Default is 3. [as integer]

len

Output vector length (uses modulo hashing). Default 4096. [as integer]

Details

Authors

seqtoolkit

Value

Integer array fingerprint where indices represent structural features

clr value class

Examples

 let seqs = read.fasta("./proteins.fa");
 # create protein sequence graph based on k-mer(k=3)
 let g = kmer_graph(seqs, k = 3);
 
 for(let kmer in g) {
     print(kmer |> kmer_fingerprint());
 }

[Package proteinKit version 1.0.0.0 Index]