Function to sanitize string for output #203

LorenzNickel · 2019-05-10T03:03:36Z

I was wondering if we could implement a function to sanitize a string by removing (or replacing) all characters which lead to an error if they are part of an audio response. For example as far as I remember a audio string must not contain any smileys, <, > (only for ssml purposes), & and probably a lot more chars.
All somehow dynamic skills have this problem, for example a skill which tells you the last tweet of a person always has to create some sort of 'clean' string without emojis and these symbols from the last tweet. This function would probably make it impossible to use valid ssml 'commands' like speech pause in it, but if you want to use this function, you can only sanitize the part of the string you get from the api (or which is dynamic for some other reason) and can still use ssml for everything else and even around the sanitized string (just not for only parts of the sanitized string).

I think providing such a function would be really helpful (in the other sdks too), but I also understand if you don't want to have it since it's not directly related. If wanted, I probably could even create a PR for this repo implementing it.
In case you want this feature, we could just remove the characters or try to replace them with an allowed equivalent (for example & gets 'and'), but then the function would also be language dependent and I'm not even sure if there would also be a problem with symbols in a certain context, for example something like ordinal numbers. Removing would definitely be way more easy.

Independently of this decision I think we should at least provide a list of characters somewhere which are 'forbidden' or are known to cause trouble. (I'd need it for a PR too, I already know some but we'd need a complete list)

jbrucej · 2019-05-27T12:39:32Z

Ditto. Would be helpful. Or perhaps just give us a list of those that are valid and/or invalid so we can write our own. Something like what's in the XML spec.

kkocel · 2022-03-07T21:30:19Z

I use this function to sanitize input org.apache.commons.text.StringEscapeUtils.escapeXml11 and it works very well.

rahulawl · 2022-09-12T10:50:38Z

Is this issue/feature-request still relevant?
We are working on prioritization of relevant issues and cleanup of rest. If we don’t hear back in 2 weeks, we will assume that the issue is not relevant and we will close it.

LorenzNickel · 2022-09-12T11:34:52Z

Yes, I think this issue is still relevant. I'm no longer actively writing Alexa Skills, but as long as you did not publish a specification and/or created such a sanitization function, it's still relevant.

Shreyas-vgr added the feature request label Jan 8, 2020

Shreyas-vgr added the enhancement label Apr 28, 2022

rahulawl added the triage label Sep 12, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Function to sanitize string for output #203

Function to sanitize string for output #203

LorenzNickel commented May 10, 2019

jbrucej commented May 27, 2019

kkocel commented Mar 7, 2022

rahulawl commented Sep 12, 2022

LorenzNickel commented Sep 12, 2022

Function to sanitize string for output #203

Function to sanitize string for output #203

Comments

LorenzNickel commented May 10, 2019

jbrucej commented May 27, 2019

kkocel commented Mar 7, 2022

rahulawl commented Sep 12, 2022

LorenzNickel commented Sep 12, 2022