Name	Name	Last commit message	Last commit date
Latest commit History 12 Commits
.github/workflows	.github/workflows
docs	docs
examples	examples
midscene	midscene
scripts	scripts
tests	tests
wiki	wiki
.env.example	.env.example
.gitignore	.gitignore
LICENSE	LICENSE
Makefile	Makefile
README.md	README.md
midscene.yml	midscene.yml
pyproject.toml	pyproject.toml
requirements.txt	requirements.txt

Name

Last commit message

Last commit date

12 Commits

Midscene Python

Midscene Python 是一个基于 AI 的自动化框架，支持 Web 和 Android 平台的 UI 自动化操作。

概述

Midscene Python 提供全面的 UI 自动化能力，具有以下核心特性：

自然语言驱动：使用自然语言描述自动化任务
多平台支持：支持 Web（Selenium/Playwright）和 Android（ADB）
AI 模型集成：支持 GPT-4V、Qwen2.5-VL、Gemini 等多种视觉语言模型
可视化调试：提供详细的执行报告和调试信息
缓存机制：智能缓存提升执行效率

项目架构

midscene-python/
├── midscene/                    # 核心框架
│   ├── core/                    # 核心框架
│   │   ├── agent/              # Agent系统
│   │   ├── insight/            # AI推理引擎
│   │   ├── ai_model/           # AI模型集成
│   │   ├── yaml/               # YAML脚本执行器
│   │   └── types.py            # 核心类型定义
│   ├── web/                     # Web集成
│   │   ├── selenium/           # Selenium集成
│   │   ├── playwright/         # Playwright集成
│   │   └── bridge/             # Bridge模式
│   ├── android/                 # Android集成
│   │   ├── device.py           # 设备管理
│   │   └── agent.py            # Android Agent
│   ├── cli/                     # 命令行工具
│   ├── mcp/                     # MCP协议支持
│   ├── shared/                 # 共享工具
│   └── visualizer/             # 可视化报告
├── examples/                   # 示例代码
├── tests/                      # 测试用例
└── docs/                       # 文档

技术栈

Python 3.9+：核心运行环境
Pydantic：数据验证和序列化
Selenium/Playwright：Web 自动化
OpenCV/Pillow：图像处理
HTTPX/AIOHTTP：HTTP 客户端
Typer：CLI 框架
Loguru：日志记录

快速开始

安装

pip install midscene-python

基础用法

from midscene import Agent
from midscene.web import SeleniumWebPage

# 创建 Web Agent
with SeleniumWebPage.create() as page:
    agent = Agent(page)
    
    # 使用自然语言进行自动化操作
    await agent.ai_action("点击登录按钮")
    await agent.ai_action("输入用户名 '[email protected]'")
    await agent.ai_action("输入密码 'password123'")
    await agent.ai_action("点击提交按钮")
    
    # 数据提取
    user_info = await agent.ai_extract("提取用户个人信息")
    
    # 断言验证
    await agent.ai_assert("页面显示欢迎信息")

主要特性

🤖 AI 驱动的自动化

使用自然语言描述操作，AI 自动理解并执行：

await agent.ai_action("在搜索框中输入'Python教程'并搜索")

🔍 智能元素定位

支持多种定位策略，自动选择最优方案：

element = await agent.ai_locate("登录按钮")

📊 数据提取

从页面提取结构化数据：

products = await agent.ai_extract({
    "products": [
        {"name": "产品名称", "price": "价格", "rating": "评分"}
    ]
})

✅ 智能断言

AI 理解页面状态，进行智能断言：

await agent.ai_assert("用户已成功登录")

许可证

MIT License

About

Midscene Python 是一个基于AI的UI自动化测试框架，支持Web和Android平台。其核心创新在于使用自然语言驱动自动化任务，通过集成视觉语言模型（如GPT-4V、Qwen2.5-VL、Gemini）来理解用户意图并执行操作.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Midscene Python

概述

项目架构

技术栈

快速开始

安装

基础用法

主要特性

🤖 AI 驱动的自动化

🔍 智能元素定位

📊 数据提取

✅ 智能断言

许可证

About

Uh oh!

Releases 2

Packages

Languages

License

Python51888/Midscene-Python

Folders and files

Latest commit

History

Repository files navigation

Midscene Python

概述

项目架构

技术栈

快速开始

安装

基础用法

主要特性

🤖 AI 驱动的自动化

🔍 智能元素定位

📊 数据提取

✅ 智能断言

许可证

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Languages

Packages