Skip to content

Commit

Permalink
[Webcrawler] Avoid null bytes in tags
Browse files Browse the repository at this point in the history
Description
---
Fixes issue from [thread](https://dust4ai.slack.com/archives/C05F84CFP0E/p1732793680391139)

(in addition to PR #8869)

Risks
---
na

Deploy
---
connectors
  • Loading branch information
philipperolet committed Nov 28, 2024
1 parent 6889dab commit 4a22e39
Showing 1 changed file with 1 addition and 1 deletion.
Original file line number Diff line number Diff line change
Expand Up @@ -327,7 +327,7 @@ export async function crawlWebsiteByConnectorId(connectorId: ModelId) {
}),
documentUrl: request.url,
timestampMs: new Date().getTime(),
tags: [`title:${pageTitle}`],
tags: [`title:${stripNullBytes(pageTitle)}`],
parents: getParentsForPage(request.url, false),
upsertContext: {
sync_type: "batch",
Expand Down

0 comments on commit 4a22e39

Please sign in to comment.