Audio-to-text transcription via x402 micropayment at $0.003 per request.
What it does
This x402-enabled endpoint at convert.kasandell.workers.dev/convert/audio provides audio-to-text transcription. You POST an audio file and receive a JSON object containing the transcribed text. The service accepts common audio formats including MP3 (audio/mpeg), WAV, M4A, OGG, FLAC, and WebM.
Pricing is $0.003 per request, settled via the x402 protocol. The endpoint returned a 402 Payment Required challenge on probe, confirming it is live and operational. The service is hosted on Cloudflare Workers.
Documentation is minimal — the only available information comes from the root JSON descriptor served at the origin. There is no OpenAPI spec, no dedicated docs page, and no additional endpoints beyond the single audio conversion route. The response schema is simple: a JSON object with a single `text` string field containing the transcription result.
Capabilities
Use cases
- —Transcribing audio recordings to text on a per-request micropayment basis
- —Adding speech-to-text capability to an AI agent pipeline without subscription commitments
- —Converting podcast or meeting audio clips into searchable text
Fit
Best for
- —Low-volume or on-demand audio transcription without API key signup
- —Agent workflows that need pay-per-call speech-to-text
- —Developers prototyping transcription features with minimal overhead
Not for
- —High-volume batch transcription (no documented bulk or streaming support)
- —Real-time live audio streaming transcription
- —Non-audio file conversions (only audio-to-text is supported)
Quick start
curl -X POST https://convert.kasandell.workers.dev/convert/audio \
-H "Content-Type: audio/mpeg" \
-H "X-Payment: <x402-payment-header>" \
--data-binary @recording.mp3Example
Response
{
"text": "Hello, this is a sample transcription of the uploaded audio file."
}Endpoint
Quality
The endpoint is confirmed live (402 challenge captured) and the root JSON descriptor provides basic info on accepted formats, pricing, and response shape. However, there is no OpenAPI spec, no documentation pages, no x402 challenge details (the challenge object was empty), and no usage examples. Most fields are inferred from the sparse root descriptor.
Warnings
- —No OpenAPI or formal API documentation available
- —The x402 challenge object was empty — payment parameters (network, token, amount in base units) could not be verified from the probe
- —Price of $0.003 is taken from the provider's self-reported root descriptor and not independently confirmed via challenge fields
- —No information on file size limits, supported languages, or transcription model used
Citations
- —Endpoint accepts audio/mpeg, audio/wav, audio/x-m4a, audio/ogg, audio/flac, audio/webm and returns { text: string }https://convert.kasandell.workers.dev
- —Price is $0.003 per requesthttps://convert.kasandell.workers.dev
- —Endpoint returned HTTP 402 on POST, confirming it is livehttps://convert.kasandell.workers.dev/convert/audio