Skip to content

Commit

Permalink
modified default value for scarpe type:4
Browse files Browse the repository at this point in the history
  • Loading branch information
glorenzo972 committed Sep 14, 2024
1 parent 8509035 commit c36c186
Show file tree
Hide file tree
Showing 6 changed files with 13 additions and 5 deletions.
3 changes: 3 additions & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -113,6 +113,9 @@ celerybeat.pid
.venv
.environ
.environ.prod
deploy_old.sh
docker-compose-test.yml
docker-compose.yml
env/
venv/
ENV/
Expand Down
5 changes: 5 additions & 0 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
Expand Up @@ -5,6 +5,11 @@
*Andrea Sponziello*
### **Copyrigth**: *Tiledesk SRL*

## [2024-09-14]
### 0.2.18
- upgrade: worker
- modify: default value for scrape type: 4

## [2024-09-05]
### 0.2.17
- fix: nltk download on Dockerfile
Expand Down
2 changes: 1 addition & 1 deletion pyproject.toml
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
[tool.poetry]
name = "tilellm"
version = "0.2.17"
version = "0.2.18"
description = "tiledesk for RAG"
authors = ["Gianluca Lorenzo <[email protected]>"]
repository = "https://github.com/Tiledesk/tiledesk-llm"
Expand Down
4 changes: 2 additions & 2 deletions tilellm/models/item_model.py
Original file line number Diff line number Diff line change
Expand Up @@ -10,8 +10,8 @@ class ParametersScrapeType4(BaseModel):
tags_to_extract: Optional[List[str]] = Field(default_factory=list)
unwanted_classnames: Optional[List[str]] = Field(default_factory=list)
desired_classnames: Optional[List[str]] = Field(default_factory=list)
remove_lines: Optional[bool] = Field(default=False)
remove_comments: Optional[bool] = Field(default=False)
remove_lines: Optional[bool] = Field(default=True)
remove_comments: Optional[bool] = Field(default=True)

@model_validator(mode='after')
def check_booleans(cls, values):
Expand Down
2 changes: 1 addition & 1 deletion tilellm/store/pinecone/pinecone_repository_serverless.py
Original file line number Diff line number Diff line change
Expand Up @@ -135,7 +135,7 @@ async def add_pc_item(self, item, embedding_obj=None, embedding_dimension=None):

else:
metadata = MetadataItem(id=metadata_id, source=source, type=type_source, embedding=embedding)
document = Document(page_content=content, metadata=metadata.dict())
document = Document(page_content=content, metadata=metadata.model_dump()) #tolto dict()

chunks.extend(self.chunk_data(data=[document], chunk_size=chunk_size, chunk_overlap=chunk_overlap))
total_tokens, cost = self.calc_embedding_cost(chunks, embedding)
Expand Down
2 changes: 1 addition & 1 deletion worker/package.json
Original file line number Diff line number Diff line change
Expand Up @@ -9,7 +9,7 @@
"author": "",
"license": "ISC",
"dependencies": {
"@tiledesk/tiledesk-train-jobworker": "^0.0.23",
"@tiledesk/tiledesk-train-jobworker": "^0.0.24",
"dotenv": "^16.4.5",
"express": "^4.19.2"
}
Expand Down

0 comments on commit c36c186

Please sign in to comment.