Audio buffer fix #47

mainvolume · 2018-06-06T20:47:30Z

Audio buffer and format fix.

1.0.0

Added buffer size variable Minor fixes

BrunoBerisso · 2018-06-07T08:48:13Z

Hey! @mainvolume Thank you very much for this PR :)

Could you please change the base? so your changes are merged in the development branch? I try to do it myself but it ends up adding some old commits not related to your changes.

Also, it will be great if you could explain a little why this is needed? It's kind of clear looking at the code but for those not so familiar with it a short explanation will be really helpful.

Could it be the case that these changes fix #44 ?

Thanks again!

mainvolume · 2018-06-07T08:50:48Z

Surething!

mainvolume · 2018-06-07T08:55:25Z

regarding #44

Could be as the model sample-rate has to be the same device for actual decoding 😄 when streaming. Have not tested with bluetooth device, but guessing that the audio settings from the device becomes easier when not set to a static frequency and adaptable to inputbus sample rate of the device.

This way, it becomes as well possible to use to the same decoder functionality with macOS as well.

Added buffer decoding

mainvolume · 2018-06-07T10:32:02Z

Hi Bruno.

I also added a decode buffer function for already obtained buffers and other streams of audio with added start and end utterance convenience functions.

🙂

BrunoBerisso · 2018-06-07T11:10:29Z

TLSphinx/Decoder.swift


 import Foundation
 import AVFoundation
 import Sphinx

+public let bufferSize = 16384


Where this value come from? Maybe add a comment about it?

BrunoBerisso · 2018-06-07T11:12:20Z

TLSphinx/Decoder.swift

        do {
-            try AVAudioSession.sharedInstance().setCategory(AVAudioSessionCategoryRecord)
+            try AVAudioSession.sharedInstance().setCategory(AVAudioSessionCategoryPlayAndRecord, with: [.mixWithOthers, .allowBluetoothA2DP])


Maybe we should add these as parameters to startDecodingSpeech? 🤔 Maybe there isn't a fixed set of settings for the audio session that works for everybody...

Updated to this:

public func startDecodingSpeech (_ audioSessionCategoryOptions:AVAudioSessionCategoryOptions = [.mixWithOthers, .allowBluetoothA2DP], utteranceComplete: @escaping (Hypothesis?) -> ()) throws {

do { try AVAudioSession.sharedInstance().setCategory(AVAudioSessionCategoryPlayAndRecord, with: audioSessionCategoryOptions) } catch let error as NSError { print("Error setting the shared AVAudioSession: \(error)") throw DecodeErrors.CantSetAudioSession(error) }

BrunoBerisso · 2018-06-07T11:17:38Z

TLSphinx/Decoder.swift

@@ -248,7 +251,33 @@ public final class Decoder {
        engine.stop()
        engine = nil
    }
-
+
+    public func startUtterence() {


I think this and endUtterence shouldn't be public. Is my understanding that we needed public because you should call startUtterance() before startDecodingBuffer right?

That is accurate. Shall we make the endUtterence private you mean?

Updated audio session setting in decode speech function call

BrunoBerisso · 2018-06-07T11:22:12Z

TLSphinx/Decoder.swift

+		    self.start_utt()
+	  }
+
+	  public func startDecodingBuffer(buffer: AVAudioPCMBuffer!, time: AVAudioTime!, utteranceComplete: @escaping (Hypothesis?)-> ()) throws {


👏🏻👏🏻👏🏻 nice!
These will be really useful. How are you testing this?

Also, something was wrong with the tabs? jaja

The tabs... editing in github as the codebase in home and at work right now. 😂

Havent written any tests, but to bypass the microphone usage for the thinking machine implementation, a synthesized continuous buffer is passed to the function with which works quite sweet with.

The function is based on the streaming function but with the option of creating the buffer before passing it to the function, instead of using the tap in the function.

Hold on, fixing the tabs.

Updated tabs

mainvolume · 2018-06-07T11:29:19Z

There, it should be tabbed cleaner now.
🙂

Also, a sample project using TLSphinx (without buffer)
https://github.com/mainvolume/SpeechDetector

mainvolume · 2018-06-07T11:34:00Z

🤔
regarding the endUtterence...
the reason that it's public is to be able to end the utterance when there buffer is completed, or similar.

If you wish, we can make it private, but then the utterance would be running when the buffer ends considering the start call after reading the utterance.

🙂

Bruno Berisso and others added 4 commits November 8, 2016 16:01

Add Slack button

3ce7254

Merge pull request tryolabs#46 from tryolabs/development

e2fafa5

1.0.0

Updated audio buffer functionality

30ae093

Added buffer size variable Minor fixes

Merge branch 'development'

db28784

BrunoBerisso changed the base branch from master to development June 7, 2018 08:44

BrunoBerisso changed the base branch from development to master June 7, 2018 08:44

mainvolume changed the base branch from master to development June 7, 2018 08:51

mainvolume added 2 commits June 7, 2018 12:28

Update Decoder.swift

5dccbfb

Added buffer decoding

Update Decoder.swift

d938fdc

BrunoBerisso reviewed Jun 7, 2018

View reviewed changes

Update Decoder.swift

4d36d58

BrunoBerisso reviewed Jun 7, 2018

View reviewed changes

Update Decoder.swift

7983dd5

Updated audio session setting in decode speech function call

BrunoBerisso reviewed Jun 7, 2018

View reviewed changes

Update Decoder.swift

b5e84ea

Updated tabs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Audio buffer fix #47

Audio buffer fix #47

mainvolume commented Jun 6, 2018

BrunoBerisso commented Jun 7, 2018

mainvolume commented Jun 7, 2018

mainvolume commented Jun 7, 2018

mainvolume commented Jun 7, 2018

BrunoBerisso Jun 7, 2018

BrunoBerisso Jun 7, 2018

mainvolume Jun 7, 2018

BrunoBerisso Jun 7, 2018

mainvolume Jun 7, 2018

BrunoBerisso Jun 7, 2018

BrunoBerisso Jun 7, 2018

mainvolume Jun 7, 2018

mainvolume Jun 7, 2018

mainvolume commented Jun 7, 2018

mainvolume commented Jun 7, 2018

Audio buffer fix #47

Are you sure you want to change the base?

Audio buffer fix #47

Conversation

mainvolume commented Jun 6, 2018

BrunoBerisso commented Jun 7, 2018

mainvolume commented Jun 7, 2018

mainvolume commented Jun 7, 2018

mainvolume commented Jun 7, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mainvolume commented Jun 7, 2018

mainvolume commented Jun 7, 2018