data-pipelines
Here are 97 public repositories matching this topic...
🚨 🚨 Feature Request
- A new implementation (Improvement, Extension)
Is your feature request related to a problem?
Currently, if a user tries to access an index that is larger than the dataset length or tensor length, an internal error is thrown which is not easy to understand.
Description of the possible solution
We can catch the error and throw a more descriptive e
Is your feature request related to a problem? Please describe.
It's cumbersome to create the same step twice.
Describe the solution you'd like
Add a button to duplicate a step in the pipeline editor.
Ideas
We could combine this with some other ideas in a context menu (right click).
Credit to Serhii Ostapchuk for contributing this on Slack.
-
Updated
May 3, 2022 - Scala
Sending a rest call to delete a job specification throws 404 where as grpc call works fine. Steps to reproduce
curl -X DELETE "http://localhost:9100/v1/project/my-project/namespace/kush/helloworld" -H "accept: application/json"Support copy into queries
-
Updated
May 5, 2022 - TypeScript
Describe the bug
If user was selected as data entity owner he should be excluded from subsequent ownership selections
-
Updated
May 5, 2022 - Python
-
Updated
May 5, 2022 - Python
What is the feature request? What problem does it solve?
As employees leave the organization/company or users change mails , eventually the notification list configured for the job would start containing a lot of invalid mails. This causes issues with SMTP relay (e.g postfix) which could be buffering all invalid requests until the queu is full, which cause all mails coming for all jobs to b
-
Updated
May 5, 2022 - PHP
-
Updated
May 5, 2022 - Python
-
Updated
Nov 22, 2021 - Python
-
Updated
May 5, 2022 - Python
Tools/Apache Spark
https://github.com/JPHaus/data-engineering-wiki/blob/main/Tools/Apache%20Spark.md
This note is a seedling and is a great place to make your first contribution!
Concepts/Full Load
Is your feature request related to a problem? Please describe.
Executing all tests takes already about 30mins. We should try to optimize that.
Describe the solution you'd like
Much time is taken by preparing input data by writing test data to DataObjects (Csv or Hive). This could be significantly reduced by creating a custom DataObject where a DataFrame can be set as input data, which
-
Updated
Feb 18, 2022 - Go
-
Updated
Dec 15, 2017 - Java
-
Updated
Jun 6, 2021 - Jupyter Notebook
-
Updated
Mar 29, 2022 - Python
-
Updated
Aug 30, 2017 - Python
-
Updated
May 2, 2022 - Python
-
Updated
Aug 4, 2021 - Python
-
Updated
Nov 10, 2019 - Python
-
Updated
Aug 2, 2017 - Python
-
Updated
Mar 4, 2021 - Jupyter Notebook
-
Updated
May 4, 2022 - Python
-
Updated
Nov 15, 2021 - Python
Improve this page
Add a description, image, and links to the data-pipelines topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the data-pipelines topic, visit your repo's landing page and select "manage topics."

