Skip to main content
Phones are the physical iPhones connected to your TapKit account through the Mac app. The API lets you list available phones, get device information, and perform actions on them.

How phones work in the API

Every phone connected through the TapKit Mac app becomes available via the API. Each phone has a unique phone_id that you use to target actions. If you only have one phone connected, most endpoints auto-select it — you don’t need to specify a phone_id.

Available actions

You can perform these actions on a connected phone:

Touch gestures

ActionEndpointDescription
TapPOST /phones/{id}/tapSingle tap at coordinates
Double tapPOST /phones/{id}/double-tapDouble tap for zoom or text selection
Tap and holdPOST /phones/{id}/tap-and-holdLong press for context menus
FlickPOST /phones/{id}/flickFast swipe gesture
PanPOST /phones/{id}/panSlow drag gesture
DragPOST /phones/{id}/dragDrag between two points
Hold and dragPOST /phones/{id}/hold-and-dragLong press then drag
PinchPOST /phones/{id}/pinchPinch to zoom in/out

Device control

ActionEndpointDescription
HomePOST /phones/{id}/homeGo to home screen
LockPOST /phones/{id}/lockLock the device
UnlockPOST /phones/{id}/unlockUnlock the device
VolumePOST /phones/{id}/volumeAdjust volume up/down
RotatePOST /phones/{id}/rotateRotate screen orientation
SpotlightPOST /phones/{id}/spotlightOpen Spotlight search
SiriPOST /phones/{id}/siriActivate Siri

App control

ActionEndpointDescription
Open appPOST /phones/{id}/open-appOpen any app by name or bundle ID
Type textPOST /phones/{id}/typeType into the focused text field
ScreenshotPOST /phones/{id}/screenshotCapture the current screen
Run shortcutPOST /phones/{id}/shortcutRun an iOS Shortcut

Coordinate system

All touch actions use pixel coordinates that map 1:1 with screenshot pixels. Screenshots are scaled so the longest edge is 1344px. The API handles native-to-scaled coordinate conversion transparently. For example, if a screenshot shows a button at position (300, 672), you send {"x": 300, "y": 672} to the tap endpoint.

Selector-based actions

Many touch actions also support a selector variant (e.g., tap-select, drag-select) that lets you target elements by description rather than coordinates. This is useful when you know what you want to tap but not exactly where it is. See the individual endpoint pages for details on selector parameters.

Device management

EndpointDescription
GET /phonesList all connected phones
GET /phones/{id}Get phone info (name, screen size, status)
GET /phones/{id}/settingsGet phone settings
PUT /phones/{id}/settingsUpdate phone settings