How to Build a Voice-controlled Virtual Assistant (IVR) in Go with Gin-Gonic and Plivo

A virtual assistant can help your business if you have clients who call your phone number. Interactive voice response (IVR) helps you to automate call reception by routing callers to the most appropriate department or the agent most qualified to meet their needs. Among its many advantages, IVR can provide increased operational efficiency, a stronger brand image, and better customer insights.

A voice-controlled virtual assistant is one step ahead of the legacy Touch-Tone/DTMF controlled one because of the flexibility it allows end-users. They can just speak into their phone’s microphone to provide input to control the call.

Building a voice-controlled virtual assistant using Plivo’s automatic speech recognition (ASR) feature in Go using the Gin-Gonic application is simple. This guide shows you how to set up a voice-controlled IVR phone tree to a Plivo number and manage the call flow when the call reaches the Plivo voice platform. To see how to do this, we’ll build a rails application to receive an incoming call and use the GetInput XML element to capture speech input and implement a simple IVR phone system.

Prerequisites

Before you get started, you’ll need:

  • A Plivo account — sign up for one for free if you don’t have one already.
  • A voice-enabled Plivo phone number if you want to receive incoming calls. To search for and buy a number, go to Phone Numbers > Buy Numbers on the Plivo console. Buy a New Plivo Number
  • Golang, Gin-Gonic, and Plivo go packages.
  • ngrok — a utility that exposes your local development server to the internet over secure tunnels.

How it works

Receive Speech Inputs

Create a Gin web application to create a voice-controlled virtual assistant

Once you’ve installed Golang, Gin-Gonic, and Plivo go packages, create a simple Gin web application to handle incoming calls on a Plivo number. To handle an incoming call, you need to return an XML document from the URL configured as the Answer URL in the application assigned to the Plivo number. The Go SDK can manage the XML document generation, and you can use the GetInput XML element to capture speech inputs and implement a simple IVR phone system. Use this code:

package main

import (
	"github.com/gin-gonic/gin"
	"github.com/plivo/plivo-go/xml"
)

func main() {
	r := gin.Default()
	//  Welcome message - firstbranch
	WelcomeMessage := "Welcome to the demo app, Say Sales to talk to our Sales representative. Say Support to talk to our Support representative"
	// This is the message that Plivo reads when the caller does nothing at all
	NoInput := "Sorry, I didn't catch that. Please hangup and try again later."
	// This is the message that Plivo reads when the caller inputs a wrong digit.
	WrongInput := "Sorry, it's a wrong input."
	r.GET("/virtual_assistant/", func(c *gin.Context) {
		c.Header("Content-Type", "application/xml")
		response := xml.ResponseElement{
			Contents: []interface{}{
				new(xml.GetInputElement).
					SetAction("https://" + c.Request.Host + "/virtual_assistant/firstbranch/").
					SetMethod("POST").
					SetInputType("speech").
					SetInterimSpeechResultsCallback("https://" + c.Request.Host + "/virtual_assistant/firstbranch/").
					SetInterimSpeechResultsCallbackMethod("POST").
					SetRedirect(true).
					SetContents([]interface{}{
						new(xml.SpeakElement).
							AddSpeak(WelcomeMessage, "WOMAN", "en-US", 1),
					}),
				new(xml.SpeakElement).
					AddSpeak(NoInput, "WOMAN", "en-US", 1),
			},
		}
		c.String(200, response.String())
	})
	r.GET("/virtual_assistant/firstbranch/", func(c *gin.Context) {
		c.Header("Content-Type", "application/xml")
		speech := c.Query("Speech")
		fromnumber := c.Query("From")
		// result := "Digit Input is:" + digit + " "
		if speech == "sales" {
			c.XML(200, xml.ResponseElement{
				Contents: []interface{}{
					new(xml.DialElement).
						SetCallerID(fromnumber).
						SetContents([]interface{}{
							new(xml.NumberElement).
								SetContents("+14156667777"),
						}),
				},
			})
		} else if speech == "support" {
			c.XML(200, xml.ResponseElement{
				Contents: []interface{}{
					new(xml.DialElement).
						SetCallerID(fromnumber).
						SetContents([]interface{}{
							new(xml.NumberElement).
								SetContents("+14156667778"),
						}),
				},
			})
		} else {
			response := xml.ResponseElement{
				Contents: []interface{}{
					new(xml.SpeakElement).
						AddSpeak(WrongInput, "WOMAN", "en-US", 1),
				},
			}
			c.String(200, response.String())
		}
	})
	r.Run() // listen and serve on 0.0.0.0:8080 (for windows "localhost:8080")
}

Test the code locally

Save this code in any file (name the file something like virtual_assistant.go). To run this file on the server, go to the folder where this file resides and use the following command:

$ go run virtual_assistant.go

And you should see your basic server app in action on http://localhost:8080/virtual_assistant/.

Expose the local server to the internet using ngrok

Once you see the application working locally, the next step is to connect the application to the internet to return the XML document to process the incoming call. For that, we recommend using ngrok, which exposes local servers behind NATs and firewalls to the public internet over secure tunnels. Install it and run ngrok on the command line, specifying the port that hosts the application on which you want to receive incoming calls (8080 in this case, as our local Gin web application is running there):

$ ./ngrok http 8080

Ngrok CLI

Ngrok will display a forwarding link that you can use as a webhook to access your local server over the public network.

Test the link by opening the ngrok URL (https://59d9-49-206-115-115.ngrok.io/virtual_assistant/) in a browser or use HTTPie to check the XML response from the ngrok URL.

XML document with GetDigits XML element

Connect the Gin web application to a Plivo number

The final step is to configure the application as a Plivo voice application and assign it to a Plivo number on which you want to activate the voice-controlled virtual assistant.

Go to the Plivo console and navigate to Voice > Applications > XML, then click on the Add New Application button in the upper right.

Provide a friendly name for the application — we used “App-Virtual-Assistant” — and configure the ngrok URL https://59d9-49-206-115-115.ngrok.io/virtual_assistant/ as the Answer URL. Select the HTTP verb as POST, then click Create Application.

Create Plivo App for voice-controlled IVR MVC app

Now go to Phone Numbers > Your Numbers and click on the number to which you want to assign the application. From the Plivo Application drop-down, choose the voice application you just created. Finally, click Update Number.

Assign Virtual-Assistant Plivo App

Test the application

Make a phone call to the Plivo number you selected. You should see that the VirtualAssistant rails application automatically routes the call to the Sales and Support departments based on the speech inputs received on the call.

And that’s how simple it is to set up a voice-controlled virtual assistant on a Plivo number and handle it using XML documents using Plivo’s Plivo’s Ruby SDK and a rails application. You can implement other use cases on the Plivo Voice platform, such as phone system IVR, call forwarding, and number masking, as your business requires.

Haven’t tried Plivo yet? Getting started is easy and only takes five minutes. Sign up today.

comments powered by Disqus

By submitting this form, you agree we may contact you in the manner described in our Privacy Policy