Focused crawls are collections of frequently-updated webcrawl data from narrow (as opposed to broad or wide) web crawls, often focused on a single domain or subdomain.
A java application that loads 3 csv files obtained from ntopng, CICFlowmeter, and nDPI. It compares the flows statistics obtained from pcap files with CICFlowmeter and ntopng and once a match is found that flow is labeled with the Layer 7 protocol obtained with the nDPI library within ntopng. The application delivers a csv file with all the flows labeled with the layer 7 protocol