You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is your feature request related to a problem? Please describe.
Some use-cases, such as GenAI, could be solved through running external programs. For example, to run data from Data Prepper into a Python program running LangChain, we could send events in and then enrich those events with the output from the program.
Describe the solution you'd like
Create a pipe processor. This processor will call an external command with arguments to modify events.
It operates similar to a Linux pipe (really two pipes). It would put each event on a newline as JSON into the program's stdin. Then it would read from stdout to get changes to the event to merge in.
Conceptually it is similar to data-prepper | command | data-prepper
Describe alternatives you've considered (Optional)
The AWS Lambda processor is quite similar. However, this requires the use of AWS Lambda and does not work for fully open-source use-cases.
I also considered creating both a Python processor and a LangChain processor. But, a general pipe processor would be more generic. And creating processors for these specific tools would not have much value if they don't already include the dependencies, but then we may get into some dependency issues.
Additional context
N/A
The text was updated successfully, but these errors were encountered:
Is your feature request related to a problem? Please describe.
Some use-cases, such as GenAI, could be solved through running external programs. For example, to run data from Data Prepper into a Python program running LangChain, we could send events in and then enrich those events with the output from the program.
Describe the solution you'd like
Create a
pipe
processor. This processor will call an external command with arguments to modify events.It operates similar to a Linux pipe (really two pipes). It would put each event on a newline as JSON into the program's
stdin
. Then it would read fromstdout
to get changes to the event to merge in.Conceptually it is similar to
data-prepper | command | data-prepper
Describe alternatives you've considered (Optional)
The AWS Lambda processor is quite similar. However, this requires the use of AWS Lambda and does not work for fully open-source use-cases.
I also considered creating both a Python processor and a LangChain processor. But, a general
pipe
processor would be more generic. And creating processors for these specific tools would not have much value if they don't already include the dependencies, but then we may get into some dependency issues.Additional context
N/A
The text was updated successfully, but these errors were encountered: