Enum class for Dialog UX state
Neither SPEAKING nor RECOGNIZING (LISTENING, EXPECTING, THINKING)
Waiting to start speech for speech recognition
Listening for speech recognition
Processing speech to get result(text)
TTS Speaking