
Formed in 2009, the Archive Team (not to be confused with the archive.org Archive-It Team) is a rogue archivist collective dedicated to saving copies of rapidly dying or deleted websites for the sake of history and digital heritage. The group is 100% composed of volunteers and interested parties, and has expanded into a large amount of related projects for saving online and digital history.
History is littered with hundreds of conflicts over the future of a community, group, location or business that were "resolved" when one of the parties stepped ahead and destroyed what was there. With the original point of contention destroyed, the debates would fall to the wayside. Archive Team believes that by duplicated condemned data, the conversation and debate can continue, as well as the richness and insight gained by keeping the materials. Our projects have ranged in size from a single volunteer downloading the data to a small-but-critical site, to over 100 volunteers stepping forward to acquire terabytes of user-created data to save for future generations.
The main site for Archive Team is at archiveteam.org and contains up to the date information on various projects, manifestos, plans and walkthroughs.
This collection contains the output of many Archive Team projects, both ongoing and completed. Thanks to the generous providing of disk space by the Internet Archive, multi-terabyte datasets can be made available, as well as in use by the Wayback Machine, providing a path back to lost websites and work.
Our collection has grown to the point of having sub-collections for the type of data we acquire. If you are seeking to browse the contents of these collections, the Wayback Machine is the best first stop. Otherwise, you are free to dig into the stacks to see what you may find.
The Archive Team Panic Downloads are full pulldowns of currently extant websites, meant to serve as emergency backups for needed sites that are in danger of closing, or which will be missed dearly if suddenly lost due to hard drive crashes or server failures.
Summary
Today in the R package, there are a lot of internal function calls which use only positional arguments. Change them to use keyword arguments for extra safety.
I've added this issue to provide a small, focused contribution opportunity for Hacktoberfest 2020 participants. If you are an experienced open source contributor, please leave this for beginners and consider contributing on a different backlog item.
Motivation
Using positional arguments in internal code is dangerous, because it can allow mistakes to slip through or can lead to confusing errors.
This is especially dangerous for cases where the values you'd pass to two parameters take on similar values. For example, in
lightgbm(),eval_freqandearly_stopping_roundsare both positive integers and can take on similar values. You might not known, unless you're very careful, that you accidentally specified them in the wrong order.How to Contribute
To help with this issue, propose a pull request which replaces uses of positional arguments in the R package with keyword arguments.
That means changing calls like this:
R-package/R/{roxygen2}7.1.1 installed. If you have any issues with this, just ask when you submit your PR and I can regenerate the docs for youlgb.call()print(),length(),is.character(), etc.)Refer to #3391 as an example. If you have any questions, tag me on this issue and I can help.
Assignments
If you are interested, please comment on this issue and indicate which file inR-package/R/you'd like to help on. PLEASE ONLY TAKE ONE FILE, so that multiple people can use this as a learning experience.This issue is not urgent, so you can claim a file now and not contribute until Hacktoberfest begins on October 1st.The list below tracks the assignments and completions so far.
aliases.Rcallback.R- @philip-khor (#3430)lgb.Predictor.R- @AnshuTrivedi (#3464)lgb.train.R- @iadi7ya (#3452)lightgbm.Rremoved.RThanks for contributing to
LightGBM!