Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Intermittent issue calling ML endpoint. #17

Open
mazer-rakham opened this issue Dec 16, 2024 · 3 comments
Open

Intermittent issue calling ML endpoint. #17

mazer-rakham opened this issue Dec 16, 2024 · 3 comments
Assignees

Comments

@mazer-rakham
Copy link

I think I have narrowed it down to token usage, just trying to get a confirmation. I have deployed successfully to the cloud, can ask questions to the Vector Search only perfectly fine, Semantic Ranker works, but only if you give it time before hitting it again, the Graph Rag + Semantic Ranker I have not gotten to work, it fails every time. Is there a estimation token usage somewhere for me to reference in able to troubleshoot better?
image
image

The error from App Insights is a 404 when it is traced:
image

@mazer-rakham
Copy link
Author

Have found that a hard refresh in the browser solves all issues.

@serdarmicrosoft
Copy link
Contributor

Thanks for reporting the issue @mazer-rakham . Has it been completely resolved after performing a hard refresh on your browser? While there is a 16K token limit for the Azure OpenAI model, I suspect your issue might be unrelated.

@serdarmicrosoft serdarmicrosoft self-assigned this Dec 17, 2024
@mazer-rakham
Copy link
Author

Yeah, if you hard refresh the browser it works on each of the 3 designs, so do vector only, hard refresh, semantic only, hard refresh, then combination, hard refresh and it works, if you try and do one, than the other, then it [object, Object] with the weird 404 ml endpoint in App Insights. Also I had to do alot of modification for it to deploy the postgres to the cloud and seed it correctly, I had to do a lot of py import changing and added a new setup script

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants