GitHub - OpenDFM/mobile-env-expe: Experiment codes of Mobile-Env paper.

Experiment codes for paper Mobile-Env: Building Qualified Evaluation Benchmarks for LLM-GUI Interaction.

launch.sh is the experiment launcher with text LLMs and launch_mm.sh is the launcher with MLMs. To launch the program, Mobile-Env environment v4.0b1 should be set up. WikiHow task set v1.3 is used.

We use vision-ui for Set-of-Marks. The model weights used by vision-ui can be downloaded according to https://github.com/Meituan-Dianping/vision-ui/blob/master/resources/vision_infer.md. After downloading, place it under meituan_weights folder.

Citation

@article{DanyangZhang2023_MobileEnv,
  title     = {{Mobile-Env}: Building Qualified Evaluation Benchmarks for LLM-GUI Interaction},
  author    = {Danyang Zhang and
               Zhennan Shen and
               Rui Xie and
               Situo Zhang and
               Tianbao Xie and
               Zihan Zhao and
               Siyuan Chen and
               Lu Chen and
               Hongshen Xu and
               Ruisheng Cao and
               Kai Yu},
  journal   = {CoRR},
  volume    = {abs/2305.08144},
  year      = {2023},
  url       = {https://arxiv.org/abs/2305.08144},
  eprinttype = {arXiv},
  eprint    = {2305.08144},
}

Name		Name	Last commit message	Last commit date
Latest commit History 166 Commits
branch-config		branch-config
prompts		prompts
utils		utils
weights/vilt-b32-mlm-tiny-tkn		weights/vilt-b32-mlm-tiny-tkn
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
agent.py		agent.py
agent_mm.py		agent_mm.py
branch_flag		branch_flag
launch.sh		launch.sh
launch_mm.sh		launch_mm.sh
llm_accessor.py		llm_accessor.py
main.py		main.py
main_mm.py		main_mm.py
openaiconfig.yaml		openaiconfig.yaml
requirements.txt		requirements.txt
vh_to_html.py		vh_to_html.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Citation

About

Releases

Packages

Languages

License

OpenDFM/mobile-env-expe

Folders and files

Latest commit

History

Repository files navigation

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages