The Wayback Machine - https://web.archive.org/web/20200326191122/https://github.com/topics/data-science
Skip to content
#

data-science

Here are 11,080 public repositories matching this topic...

jnboehm
jnboehm commented Mar 16, 2020

Describe the bug

Calling a pipeline with a nonparametric function causes an error since the function transform() is missing. The pipeline itself calls the function fit_transform() if it's present. For nonparametric functions (the most prominent being t-SNE) a regular transform() method does not exist since there is no projection or mapping that is learned. It could still be used f

bjornsturmberg
bjornsturmberg commented Jul 18, 2018

I became quite confused by the Algorithms to perform MCMC section.

The issue is that steps 3-4:
"3. Accept/Reject the new position based on the position's adherence to the data and prior distributions (ask if the pebble likely came from the mountain).
4.A If you accept: Move to the new position. Return to Step 1.
4.B Else: Do not move to new position. Return to Step 1.
"
imply that

Tiljander
Tiljander commented Mar 22, 2020

Hi,

First of all, thank you so much for a great library!

When there are overlapping matches in a Doc, EntityRuler prioritize longer patterns over shorter, and if equal in length the match occuring first in the Doc is chosen.

This is great, but it would be good if this behavior is explicitly stated in the documentation for clarity. Could be stated under init or call.

Which

cxrodgers
cxrodgers commented Jan 30, 2020

Note 1: This is not the same tab-completion bug for which many issues have already been opened (the one where a massive number of objects from the global namespace are displayed). This issue specifically has to do with unwanted abbreviation of filename paths. I haven't yet found any other reports of this issue.

Note 2: I also posted this to stackoverflow but am posting again here since I didn't

dash
gensim
michaeljneely
michaeljneely commented Jan 26, 2020

Describe the bug

Calling Predictor.get_gradients() returns an empty dictionary

To Reproduce
I am replicating the binary sentiment classification tasked described in the paper 'Attention is not Explanation ' (Jain and Wallace 2019 - https://arxiv.org/pdf/1902.10186.pdf).

My first experiment is on the Stanford Sentiment TreeBank Dataset. I need to measure the correlation between th

VojtechDostal
VojtechDostal commented Feb 19, 2020

Describe the bug

When doing reconciliation, I sometimes manually match entries to Wikidata items using "Search for match". I choose the correct match but the match is applied to all entries with the same name because the radio button "Match other cells with same content" is always ticked by default. Even if I change it, next time this button will again be ticked.

Expected behavior

miguelgfierro
miguelgfierro commented Mar 29, 2019

Description

@wutaomsft suggestion:

it would be a good discussion point what is preferred way to make references in notebooks. I prefer not to have a"reference" section where references are separate from where they are referred. Instead, link the reference "in place". And then add a paragraph for "additional reading", which is more descr

Improve this page

Add a description, image, and links to the data-science topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the data-science topic, visit your repo's landing page and select "manage topics."

Learn more

You can’t perform that action at this time.