Focused crawls are collections of frequently-updated webcrawl data from narrow (as opposed to broad or wide) web crawls, often focused on a single domain or subdomain.
R-package: Methods for dividing data into groups. Create balanced partitions and cross-validation folds. Perform time series windowing and general grouping and splitting of data. Balance existing groups with up- and downsampling or collapse them to fewer groups.
Currently, a seed value is required for foldl which in turn causes aggregate to require a seed value. The functions should be able to be used both with and without a seed value. Additionally, the seed value appears to be misnamed as memo. This should be changed to something more suitable.
Unit tests should be added for these as well. #2 is close to being merged and presumably will have be
In this program it is possible to create a Burrito Matrix and extract the matrix, second column as sequence, sum or absolute difference of pairs of second column.
Currently, a seed value is required for
foldlwhich in turn causesaggregateto require a seed value. The functions should be able to be used both with and without a seed value. Additionally, the seed value appears to be misnamed asmemo. This should be changed to something more suitable.Unit tests should be added for these as well. #2 is close to being merged and presumably will have be