I mean.. as opposed to two numbers? A slow speed high error correction audio encoding of them would be easy. Call emergency services, press the button, they decode the tones, immediate, accurate, simple.
If you mixed the calling and playback into a single action, you could make calls and direct people without even the ability to speak.