Pinch by Description
Pinch by Description
Pinch/zoom on an element using natural language description
POST
Pinch by Description
Pinch, zoom, or rotate on an element described in natural language. Uses vision AI to find the element and perform the gesture.Documentation Index
Fetch the complete documentation index at: https://docs.tapkit.ai/llms.txt
Use this file to discover all available pages before exploring further.
Request
Path Parameters
| Parameter | Type | Description |
|---|---|---|
phone_id | string | The phone identifier |
Query Parameters
| Parameter | Type | Default | Description |
|---|---|---|---|
async | boolean | false | Return immediately with job ID |
Request Body
| Field | Type | Default | Description |
|---|---|---|---|
selector | string | required | Natural language description of the element |
action | string | required | "pinch_in", "pinch_out", "rotate_cw", or "rotate_ccw" |
duration_ms | integer | 1000 | Gesture duration in milliseconds |
Actions
| Action | Effect |
|---|---|
pinch_out | Zoom in (fingers apart) |
pinch_in | Zoom out (fingers together) |
rotate_cw | Rotate clockwise |
rotate_ccw | Rotate counter-clockwise |
Response
Synchronous
Asynchronous
Examples
Zoom In on a Map
SDK Usage
The Python SDK provides this through thepinch() method with a string argument:
Related Endpoints
- Pinch - Pinch at specific coordinates
- Double Tap - Alternative zoom method