Package org.carrot2.text.preprocessing
Class SparseArray
java.lang.Object
org.carrot2.text.preprocessing.SparseArray
Sparse array encoding utilities. Sparse means an index and its value are kept in an array as a
pair.
-
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionstatic int
countUnique
(int[] buffer, int fromIndex, int toIndex) Count unique values in the sorted array.static int[]
mergeSparseArrays
(Iterable<int[]> source) Merge data from one or more sparse arrays.static String
sparseToString
(int[] intIntArray) Convert an int-int compact mapping array to a string.static int[]
toSparseEncoding
(com.carrotsearch.hppc.IntStack documents) Convert a list of documents to sparse document-count representation.static int[]
toSparseEncodingByHash
(com.carrotsearch.hppc.IntStack documents) Convert to sparse encoding using a hash map.static int[]
toSparseEncodingBySort
(com.carrotsearch.hppc.IntStack documents) Convert to sparse encoding using sorting and counting.
-
Constructor Details
-
SparseArray
public SparseArray()
-
-
Method Details
-
toSparseEncoding
public static int[] toSparseEncoding(com.carrotsearch.hppc.IntStack documents) Convert a list of documents to sparse document-count representation. -
toSparseEncodingByHash
public static int[] toSparseEncodingByHash(com.carrotsearch.hppc.IntStack documents) Convert to sparse encoding using a hash map. -
toSparseEncodingBySort
public static int[] toSparseEncodingBySort(com.carrotsearch.hppc.IntStack documents) Convert to sparse encoding using sorting and counting. -
countUnique
public static int countUnique(int[] buffer, int fromIndex, int toIndex) Count unique values in the sorted array. -
mergeSparseArrays
Merge data from one or more sparse arrays. -
sparseToString
Convert an int-int compact mapping array to a string.
-