Replies: 2 comments 4 replies
-
@tomtom215 Thanks for your interest! I just read the document and I am wondering how Apache SeaTunnel avoids duplication? For example, if the file is hosted in a simple Nginx server and RisingWave might pulls it for multiple times, and every time all the data will be sent to downstream. For comparison, RisingWave's S3 file source will save a list of On the other hand, we had once discussed an idea of "POST HTTP Source": it accepts |
Beta Was this translation helpful? Give feedback.
-
Hey @fuyufjh thanks for the response! As far as I can tell from the SeaTunnel documentation, they currently do not support exactly-once delivery for the HTTP source connector, but it seems that they avoid duplication from their file or kafka source by saving a read & split offset as a "StateSnapshot" I haven't found any roadmaps or discussions about including that for SeaTunnel's HTTP source yet, but I would assume it may be possible via something like a message hash + timestamp stored as a StateSnapshot, but that is just a quick idea from the top of my head. For our use-case, I believe it would align directly with your idea of POST HTTP source that accepts POST messages and injects/inserts that data to a SINK destination. For more context, we have legacy systems and applications that push DB updates via JSON messages. It's close to CDC or pub/sub, but since we don't have influence over the source systems to setup something like Debezium that helps with handling the integration with other destinations, we are looking for something like an HTTP source connector -> RisingWave -> DataWarehouse or Message Queue destination |
Beta Was this translation helpful? Give feedback.
-
Hello,
Long time lurker on this awesome project :)
I was curious about the possibility of having an HTTP(S) source because that would allow us to avoid creating some middleware to convert incoming HTTP json messages via a webhook endpoint to AMPQ messages to be put in a queue or a direct DB insert function.
Ideally, we would love if RisingWave offered this connector in the same way that Apache SeaTunnel does here https://seatunnel.apache.org/docs/2.3.2/connector-v2/source/Http
Beta Was this translation helpful? Give feedback.
All reactions