You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The first attribute cannot be removed. The second attribute (corresponding to the Mapped Term) can be removed.
Regex:
One attribute for each capturing group
All but one attribute can be removed
All attributes can be renamed
Example:
\$(\d+(\.\d+)?)\s+(billion) that captures text such as $3.4 billion should have 4 attributes by default:
Attribute 1 (covers the entire match and corresponds to group 0 of the regex, which is the entire match): $3.4 billion
Attribute 2 (covers group 1 of the regex) and matches 3.4
Attribute 3 (covers group 2 of the regex) and matches .4
Attribute 4 (covers group 3 of the regex) and matches billion
Sequence:
One attribute for each of the input nodes of the sequence
All but one attribute can be removed
All attributes can be renamed
Example 1: (<Metric.Metric>)<Token>{0,1}(<Preposition.Preposition>)<Token>{0,2}(<Division.Division>) that captures text such as revenue from the Global Technology Services should have 4 attributes by default (one for the entire match, and one for each open parenthesis):
Attribute 1 (covers the entire match and corresponds to group 0 of the sequence, which is the entire match): revenue from the Global Technology Services
Attribute 2 (covers group 1 of the sequence) and matches revenue
Attribute 3 (covers group 2 of the sequence) and matches from
Attribute 4 (covers group 3 of the sequence) and matches Global Technology Services
Example 2: (<Metric.Metric><Token>{0,1}<Preposition.Preposition>)<Token>{0,2}(<Division.Division>) that captures text such as revenue from the Global Technology Services should have 3 attributes by default (one for the entire match, and one for each open parenthesis - 2 of them):
Attribute 1 (covers the entire match and corresponds to group 0 of the sequence, which is the entire match): revenue from the Global Technology Services
Attribute 2 (covers group 1 of the sequence) and matches revenue from
Attribute 3 (covers group 3 of the sequence) and matches Global Technology Services
Union
Same attributes as those of the input nodes (the union cannot be created unless the input nodes have the same schema - the same number of attributes and the same names for the attributes)
All but one attribute can be removed
All attributes can be renamed
Consolidate
Same attributes as those of the input node
All but one attribute can be removed
All attributes can be renamed
Filter
Same attributes as those of the primary input node
All but one attribute can be removed
All attributes can be renamed
The text was updated successfully, but these errors were encountered:
The Consolidate node can only have a single input node. That's a constraint on the Consolidate node. I am not sure if the UI enforces this constraint now (if not yet, then it should).
The primary node is the input from which we include or exclude tuples. The one that shows at the top of the Filter dialog, right under the Exclude/Include dropbox, e.g., SentenceBoundary in the filter sample flow:
Currently attributes are allowed to be renamed and turned off on the Sequence node. We should enable this for all nodes.
Here is the desired behavior.
Dictionary (Mapped terms = OFF) and Literal:
Dictionary (Map terms = ON)
Mapped Term
by defaultRegex:
\$(\d+(\.\d+)?)\s+(billion)
that captures text such as$3.4 billion
should have 4 attributes by default:$3.4 billion
3.4
.4
billion
Sequence:
(<Metric.Metric>)<Token>{0,1}(<Preposition.Preposition>)<Token>{0,2}(<Division.Division>)
that captures text such asrevenue from the Global Technology Services
should have 4 attributes by default (one for the entire match, and one for each open parenthesis):revenue from the Global Technology Services
revenue
from
Global Technology Services
(<Metric.Metric><Token>{0,1}<Preposition.Preposition>)<Token>{0,2}(<Division.Division>)
that captures text such asrevenue from the Global Technology Services
should have 3 attributes by default (one for the entire match, and one for each open parenthesis - 2 of them):revenue from the Global Technology Services
revenue from
Global Technology Services
Union
Consolidate
Filter
The text was updated successfully, but these errors were encountered: