-
Médialab Sciences-Po
- Paris
- http://medialab.sciences-po.fr
Block or Report
Block or report boogheta
Report abuse
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abusePinned
-
-
-
medialab/hyphe Public
Websites crawler with built-in exploration and control web interface
-
-
see also section scraping on custom levels of depth
-
530 contributions in the last year
Less
More
Contribution activity
April 2022
Created 25 commits in 2 repositories
Opened 11 issues in 2 repositories
medialab/hyphe
10
open
- Pouvoir activer/désactiver les tags multivalue
- Pouvoir renommer un corpus
- Allow to set and filter tags from listwebentities
- Add a tool to cleanup/merge entities
- Renew code to load recent user agents
- Give access from hyphe frontend to scrapyd's crawl logs
- IMPORT add the option to load existing hyphe data/metas
- Add to CRAWL a page listing all IN uncrawled WEs and propose to crawl them
- Ajouter un lien vers les entités depuis la liste All crawls
- Problems with encoded characters in prefixes

