The Wayback Machine - https://web.archive.org/web/20200304113101/https://github.com/topics/data-analysis
Skip to content
#

data-analysis

Here are 5,338 public repositories matching this topic...

Khris777
Khris777 commented Feb 18, 2020

Code Sample, a copy-pastable example if possible

import pandas as pd
import numpy as np

df = pd.DataFrame({'A': [1, 2, 3, 4, 5], 'B': [7, 5, np.nan, 3, 2]})
df.plot(x='A', y='B')
df = df.astype('Int64')
df.plot(x='A', y='B')

Problem description

The first plotting command works, the second throws the error message

TypeError: float() argument must be a string o

rgarrigue
rgarrigue commented Feb 10, 2020

Hi there

I'm trying to parse this king of lines, from a python flask service whose log format is %(asctime)s [%(process)d] (%(levelname)s) (%(name)s): %(message)s

2020-02-10 13:58:38,594 [31383] (INFO) (flask.app): request: OPTIONS https://server_hostname/0.1/token/a_big_uuid {'Host': 'server_hostname', 'X-Script-Name': '/api/auth', 'X-Forwarded-For': 'an_IP_address', 'Connection': 'c
Christilut
Christilut commented Nov 6, 2019

Is your feature request related to a problem? Please describe.
When working with a big piece of text, I sometimes scroll down and copy some text into another tab. When switching back to the first tab, both the input and the output pane is back on top. So I don't know where I was working just now.

Describe the solution you'd like
After tab switching, scroll position should be remembere

antoine2711
antoine2711 commented Feb 9, 2020

Describe the bug
I'm trying to export with « Custom tabular exporter… » and choosing « Upload to -> A new Google spreadsheet », and in some rows, if a value is missing, then the cell value disapear and the cell on the right shift left.

To Reproduce
Steps to reproduce the behavior:

  1. « Custom tabular exporter… »
  2. choose « Upload to -> A new Google spreadsheet »
  3. See the result
ChadFulton
ChadFulton commented Sep 23, 2019

Need to do some better handling of low-observation models in plot_diagnostics. These are models that shouldn't really be estimated, and we can't really make the plots work, but we shouldn't raise exceptions.

  • Any dataset with less than 10 observations will raise an error computing the error autocorrelations:
mod = sm.tsa.statespace.SARIMAX(np.random.normal(size=10), order=(10, 
neomatrix369
neomatrix369 commented Jan 13, 2020

Missing functionality

On the back of the issue raised - pandas-profiling/pandas-profiling#315 - I would like to request for improved documentation about removed features and alternative ways to overcome them when using the new version of pandas-profiling, starting v2.4.0.

Proposed feature

  • docs on why style={'full_width': True}, minify_html=True ca
zoj613
zoj613 commented Feb 19, 2020

Description

Calling fit method of Pipeline object throws an expection: UnboundLocalError: local variable 'cloned_transformer' referenced before assignment, when the memory argument is passed an argument. Therfore I am unable to cache any transformers (especially during hyperparameter tuning using a Pipeline object.

Steps/Code to Reproduce

Example:

from imblearn.pipel
omrihar
omrihar commented Jun 29, 2019

I've been trying out knowledge-repo and came across multiple problems that seems to have resulted from following the old documentation on knowledge-repo.readthedocs.io. Looking through the issues it seems that several PRs modified the docs, but the changes didn't propagate yet to the documentation website. It might be a good idea to synchronize the two, so that new users can have a lower entrance

satrum
satrum commented Dec 16, 2019

How can i implement callback parameter in fit moder Autoencoder ?
There is not parameter.

from keras.callbacks.callbacks import EarlyStopping
cb_earlystop = EarlyStopping(monitor='val_loss', min_delta=0, patience=0, verbose=0,
mode='auto', baseline=None, restore_best_weights=False)
pyod_model.fit(scaler, callbacks=[cb_earlystop])

TypeError: fi

jcr-lyxor
jcr-lyxor commented Mar 13, 2019

Following #68 that would be very nice to have extra information in the model. Maybe I could find it but I do see that in the documentation:

  1. Recovering the early stopping epoch number
  2. Recovering the cross validation test and train losses / metrics
  3. Recovering eval_set losses / metrics
  4. Be able to pass a path to a libffm data format
kraxli
kraxli commented Jan 6, 2020

the space between two plots in a facet plot is not large enough for the tick values. Hence the values overlap with the left adjacent figure (please see screenshot).

image

I use the command:
p9.facet_wrap(facets = 'currency', nrow=2, scales='free_y', shrink=True)
with plotnine 0.6 w

rhiever
rhiever commented Jan 31, 2018

I suggest keeping the README short and to the point: badges showing status etc. of the package, what is the purpose of the package, how to install, 1-2 basic code examples, and contributing information.

More examples, function API docs, and detailed descriptions can be moved to the docs site.

I also suggest adding a gallery to the docs, [similar to Seaborn](https://seaborn.pydata.org/example

metasoarous
metasoarous commented Nov 25, 2019

First off, thanks so much for tad; All in all I'm loving it as a light weight CSV viewer!

The only issue I've had so far is that someone passed me a semi-colon delimited file with some decimals, and strangely, these numbers load as ints unless I replace the semi-colons with commas. Any thoughts what might be going on here?

Thanks again!

Improve this page

Add a description, image, and links to the data-analysis topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the data-analysis topic, visit your repo's landing page and select "manage topics."

Learn more

You can’t perform that action at this time.