big-data

Now insert and query share the resource ( Max Process Count control) 。 When the query with high TPS，the insert will get error (“error: too many process”). I think separator the resource for Insert and Query will makes sense. Ensure enough resource for insert。It looks like Use Yarn， Insert and Query use the different resource quota。
Or the simple way , Can we set Ratio for Insert and

The latest copy of the CPython grammar tests in test_grammar.py has several @skips and FIXMEs. Some of them seem easy to fix, e.g. some parser bugs or missing warnings that would be helpful, others are entire features. We should fix the easy ones and make sure there are tickets for the rest.

Hi!

There is no opportunity to get fitted models after cv, because catboost.cv returns only evaluation metric scores. At the same time popular ml libraries have such option in some kind.

For LightGBM there is optional argument return_cvbooster:

cv = lgb.cv(params, X_train, show_stdv=False, stratified=True

Lots of white space on either side, can that be reduced
Make it so all count columns, except the far right maybe could be hidden OR just allow any column to be hidden will menu per column
Would be nice to be able to resize columns
Support adding 1 or 2 more sub fields (once i had the columns there will be room! :) )

We have added default serializers to the 4.2 series here:
hazelcast/hazelcast#17934

There is a problem with backward compatibility. If a user had CustomSerializer for optional in 4.1, in 4.2 there is no way to use their serializers, and Hazelcast will throw java.lang.IllegalArgumentException: [class java.util.Optional] serializer cannot be overridden

Users can basi

PrestoDB https://prestodb.io .. is widely used as SQL frontend for many different data-sources, including ElasticSearch, and even files in S3 .. would be very nice if there would be a Connector available for Vespa.

Hi, if my spark app is using 2 storage type, both S3 and Azure Data Lake Store Gen2, could I put spark.delta.logStore.class=org.apache.spark.sql.delta.storage.AzureLogStore, org.apache.spark.sql.delta.storage.S3SingleDriverLogStore

Thanks in advance

Dec	JAN	Feb
	01
2020	2021	2022

big-data

Here are 2,290 public repositories matching this topic...

apache / spark

binhnguyennus / awesome-scalability

donnemartin / data-science-ipython-notebooks

explosion / spaCy

apache / flink

ClickHouse / ClickHouse

apache / predictionio

amark / gun

prestodb / presto

yahoo / CMAK

heibaiying / BigData-Notes

apache / storm

cython / cython

catboost / catboost

h2oai / h2o-3

apache / zeppelin

pachyderm / pachyderm

apache / couchdb

arkime / arkime

apache / beam

tschellenbach / Stream-Framework

hazelcast / hazelcast

intel-analytics / BigDL

apache / ignite

apache / hive

vespa-engine / vespa

delta-io / delta

TuiQiao / CBoard

linkedin / datahub

databricks / koalas

Improve this page

Add this topic to your repo