How do Voice And Video Call Work?
- Authors
- Name
- Amit Shekhar
- Published on
I am Amit Shekhar, Co-Founder @ Outcome School, I have taught and mentored many developers, and their efforts landed them high-paying tech jobs, helped many tech companies in solving their unique problems, and created many open-source libraries being used by top companies. I am passionate about sharing knowledge through open-source, blogs, and videos.
Join Outcome School and get high paying tech job: Outcome School
Before we start, I would like to mention that, I have released a video playlist to help you crack the Android Interview: Check out Android Interview Questions and Answers.
This blog is all about how voice and video call works on a high level.
Voice over Internet Protocol (VoIP) is one of the most popular standards for voice and video calling over the web.
We all use voice and video on various platforms like WhatsApp, Skype, Messenger, Facebook, etc.
Both voice and video calls depend on how we stream media between the two clients which are connected to each other. So, there must be something that can do the work of media streaming from one client to another client.
For media streaming, we need to know about WebRTC.
WebRTC is a free, open project that provides browsers and mobile applications with Real-Time Communications (RTC) capabilities via simple APIs. The WebRTC components have been optimized to best serve this purpose.
But there are many other things that we need to do as WebRTC is not enough for complete implementation.
Other things are:
- Signaling
- STUN Server
- TURN Server
Signaling
What is Signaling?
In order to set up a call between two clients, both clients must conform to each other by sending key data, messages, and metadata about the media. Over signaling, we do these things.
We can use WebSocket for the purpose of signaling.
Signaling is just used to know that these two clients want to connect to each other for the call.
Peer-to-Peer Connection
After signaling, we need to connect both the client peer to peer. And for connecting, we must have the public IP address of both clients.
So, in order to get the public IP address, we use the STUN Server.
STUN Server
STUN Server is used to get the public IP address.
Why do we need a public IP address?
A Public IP Address is an IP address that is globally unique across the Internet. Only one device may be in possession of a public IP address.
A Private IP Address is an IP address that is not globally unique and may exist simultaneously on many different devices. A private IP address is never directly connected to the Internet.
The NAT(Network Address Translation) provides the local IP address of the device which can't be used publicly to connect peer to peer. And for WebRTC, we need to have a public IP address. STUN Server provides that.
If everything is fine, we get the public IP addresses of both clients, and then, we connect both clients through WebRTC to start the call. WebRTC handles all the media streaming.
The real-world connectivity is not ideal.
In case, we are not able to get the public IP address of both clients. Then we can't connect peer to peer. In that case, we need the TURN Server.
TURN Server
TURN Server is used to connect both clients if peer-to-peer fails by acting as a mediator. Basically, it takes the data from one client and sends it to another client. So, its job is to relay the media.
This way, the two clients start talking to each other.
The other small data which are not related to media like a client cuts the call, any setting changes, messages and etc are sent over the signaling process.
The following question arises.
Why WebRTC can't do signaling?
Answer: To avoid redundancy and to maximize compatibility with established technologies, the signaling methods and protocols are not specified by the WebRTC Standards.
WebRTC is optimized for media.
So, this way the voice and video call works.
Prepare yourself for Android Interview: Android Interview Questions
That's it for now.
Thanks
Amit Shekhar
Co-Founder @ Outcome School
You can connect with me on:
Follow Outcome School on: