Single Tap
The most basic gesture - tap at specific coordinates:Using with Bounding Boxes
When working with UI element detection, tap the center of a bounding box:Double Tap
Double tap for actions like zooming in:Tap and Hold (Long Press)
Press and hold at a location. Useful for context menus or drag preparation:| Parameter | Type | Default | Description |
|---|---|---|---|
point | tuple/Point | required | Coordinates to tap |
duration_ms | int | 1000 | Hold duration in milliseconds |
Coordinate Systems
TapKit uses absolute pixel coordinates matching the device’s screen resolution:Working with Normalized Coordinates
If your vision model returns normalized (0-1) coordinates:Return Value
All tap methods return aJob object: