Tuesday, 2 May 2017

From Sed function to JavaScript Regex

I have this text normalization function in bash

normalize_text() {
  tr '[:upper:]' '[:lower:]' | sed -e 's/^/__label__/g' | \
    sed -e "s/'/ ' /g" -e 's/"//g' -e 's/\./ \. /g' -e 's/<br \/>/ /g' \
        -e 's/,/ , /g' -e 's/(/ ( /g' -e 's/)/ ) /g' -e 's/\!/ \! /g' \
        -e 's/\?/ \? /g' -e 's/\;/ /g' -e 's/\:/ /g' | tr -s " "
}

I need to transform it into a JavaScript RegExp in order to parse text within a node.js application.



via loretoparisi

No comments:

Post a Comment