We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
appAgent的效果极大程度上依赖于LLM返回的结果准确度,我用了gpt4-vision-preview的,给他一个输入手机号的界面,这个时候预期的结果应该是点击(tap)输入手机号的输入框,而不是直接text手机号码,我测试了gpt4v,qwen2max,glm4v9b还有豆包,都返回的是text手机号码,总感觉差一点点。
难道说其实现在LLM对于agent的能力还是不够好,或者说得单独训练这反面的能力吗,如果是的话该如何设置数据集呢?就像代码中的promote这样设计吗,输入是一大段,输出是四段式?
The text was updated successfully, but these errors were encountered:
确实一般,可能appagentV2会好一点吧
Sorry, something went wrong.
兄弟你试的感觉怎么样,用gpt4o感觉如何?
有工程bug,执行时间略慢。期待V2版本
No branches or pull requests
appAgent的效果极大程度上依赖于LLM返回的结果准确度,我用了gpt4-vision-preview的,给他一个输入手机号的界面,这个时候预期的结果应该是点击(tap)输入手机号的输入框,而不是直接text手机号码,我测试了gpt4v,qwen2max,glm4v9b还有豆包,都返回的是text手机号码,总感觉差一点点。
难道说其实现在LLM对于agent的能力还是不够好,或者说得单独训练这反面的能力吗,如果是的话该如何设置数据集呢?就像代码中的promote这样设计吗,输入是一大段,输出是四段式?
The text was updated successfully, but these errors were encountered: