Focused crawls are collections of frequently-updated webcrawl data from narrow (as opposed to broad or wide) web crawls, often focused on a single domain or subdomain.
During a recent Dask tutorial someone asked "Can Datashader run on Dask?" and I was pleased.
It would be good to have an easy-to-run example that was advertised Datashader from Dask resources. Often we do this in examples.dask.org . Is there an example that makes sense to deploy there?
Hackathon project: Geographical map using textual data, topics reported by school inspection reports to identify models for improving the quality of education
Malastare.ai is a startup Analytics Consulting Firm. Based in Texas, USA. We combine deep industry knowledge with specialized expertise in analytics, strategy, operations, and risk management. We leverage our clients' real-world experience, industry best practices and technology best practices to enable them to succeed in their big data projects.
Are we alone in the universe? - Data Analysis and Data Visualization of UFO sightings with R. How to analyze and visualize data of UFO sightings of the last century in the USA and the rest of the world with R language.
Over the past months, we have seen a significant racial justice reckoning happening across the country since the killing of George Floyd by a police officer in May 2020. This incident sparked a redirection of attention to similar lives that had been lost at the hands of officers, leading to calls for re-evaluation of the role and power that police hold. In order for stakeholders like activism groups and local policymakers to make the most change in the quickest and most effective manner in response to these calls, the data code and report strived to answer a questions that will enable this. The primary tool used was R, with ggplot and machine learning packages.
This is a Repository made for Coursera Assignments, and Tutorials which includes many interesting plots such as waffle charts, folium charts, chloropeth charts etc.
An analysis of my Facebook social network using a Gephi network graph visualization to determine the number and types of communities I am a part of online.
After reading many article that used data from Apple's health app. I wanted to try to create some data visualizations of test set count dataset. I used python, matplotlib and seaborn to analyze the step counts from the test data. I have included the work in a jupyter notebook file and the csv file that has the test data used for this side project.
I analyze and explore US Census Bureau Data using Data Visualization techniques to identify salient features useful for predicting an individual's income level. We use those relevant features and multiple classification methods (Decision-Tree, SVM, and K-Nearest Neighbor) to predict the income level for unknown individuals. Our client is a local University who wants to use income as the key demographic to decide criteria for marketing its degree programs. Each classifier explored has an accuracy of over 85%.
During a recent Dask tutorial someone asked "Can Datashader run on Dask?" and I was pleased.
It would be good to have an easy-to-run example that was advertised Datashader from Dask resources. Often we do this in examples.dask.org . Is there an example that makes sense to deploy there?