<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic How to filter rows that start with ### and then parse between ### and Text in Alteryx Designer Desktop Discussions</title>
    <link>https://community.alteryx.com/t5/Alteryx-Designer-Desktop-Discussions/How-to-filter-rows-that-start-with-and-then-parse-between-and/m-p/243038#M51276</link>
    <description>&lt;P&gt;Okay, I have a nightmare (for my novice eyes) data source. &amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The source document is a PDF. &amp;nbsp;The PDF does not export nicely to XML or CSV and it will not import into Excel in a format that I can use. &amp;nbsp;I was able to get it into Alteryx as a CSV input. &amp;nbsp; But, everything is trapped in one field. &amp;nbsp;There are a few exceptions, but first things first.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Fortunately, the source document is outlined. &amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The vast majority of what I need to extract is prefaced with a numbered outline reference. &amp;nbsp;The format appears to be ###-#.#.#....&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;1. &amp;nbsp;How can I filter out those rows that do not start ###? &amp;nbsp;Let me clarify, it must start with a 3-digit number ###.&lt;/P&gt;&lt;P&gt;2. &amp;nbsp;There is always an [Orig] between the outline ### and the text. &amp;nbsp;How can I get the parse tool to use the [Orig] to output everything before it to column 1 and everything after it to column 2?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;TABLE&gt;&lt;TBODY&gt;&lt;TR&gt;&lt;TD&gt;Field 1&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;SECTION 0: [Orig] General Spec Admin&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;042 [Orig] General Spec Requirements&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;042-1 [Orig] General&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;&lt;P&gt;042-1.2 [Orig] Items blah blah blah&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;&lt;P&gt;042-1.3 [Orig] The Contractor blah blah blah&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;&lt;P&gt;Table 042-1 [Orig] The objective items blah&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;/TR&gt;&lt;/TBODY&gt;&lt;/TABLE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Thu, 13 Jun 2024 22:43:18 GMT</pubDate>
    <dc:creator>hellyars</dc:creator>
    <dc:date>2024-06-13T22:43:18Z</dc:date>
    <item>
      <title>How to filter rows that start with ### and then parse between ### and Text</title>
      <link>https://community.alteryx.com/t5/Alteryx-Designer-Desktop-Discussions/How-to-filter-rows-that-start-with-and-then-parse-between-and/m-p/243038#M51276</link>
      <description>&lt;P&gt;Okay, I have a nightmare (for my novice eyes) data source. &amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The source document is a PDF. &amp;nbsp;The PDF does not export nicely to XML or CSV and it will not import into Excel in a format that I can use. &amp;nbsp;I was able to get it into Alteryx as a CSV input. &amp;nbsp; But, everything is trapped in one field. &amp;nbsp;There are a few exceptions, but first things first.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Fortunately, the source document is outlined. &amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The vast majority of what I need to extract is prefaced with a numbered outline reference. &amp;nbsp;The format appears to be ###-#.#.#....&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;1. &amp;nbsp;How can I filter out those rows that do not start ###? &amp;nbsp;Let me clarify, it must start with a 3-digit number ###.&lt;/P&gt;&lt;P&gt;2. &amp;nbsp;There is always an [Orig] between the outline ### and the text. &amp;nbsp;How can I get the parse tool to use the [Orig] to output everything before it to column 1 and everything after it to column 2?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;TABLE&gt;&lt;TBODY&gt;&lt;TR&gt;&lt;TD&gt;Field 1&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;SECTION 0: [Orig] General Spec Admin&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;042 [Orig] General Spec Requirements&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;042-1 [Orig] General&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;&lt;P&gt;042-1.2 [Orig] Items blah blah blah&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;&lt;P&gt;042-1.3 [Orig] The Contractor blah blah blah&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;&lt;P&gt;Table 042-1 [Orig] The objective items blah&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;/TR&gt;&lt;/TBODY&gt;&lt;/TABLE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 13 Jun 2024 22:43:18 GMT</pubDate>
      <guid>https://community.alteryx.com/t5/Alteryx-Designer-Desktop-Discussions/How-to-filter-rows-that-start-with-and-then-parse-between-and/m-p/243038#M51276</guid>
      <dc:creator>hellyars</dc:creator>
      <dc:date>2024-06-13T22:43:18Z</dc:date>
    </item>
    <item>
      <title>Re: How to filter rows that start with ### and then parse between ### and Text</title>
      <link>https://community.alteryx.com/t5/Alteryx-Designer-Desktop-Discussions/How-to-filter-rows-that-start-with-and-then-parse-between-and/m-p/243121#M51280</link>
      <description>&lt;P&gt;This is one for regex.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Firstly I use a regex_match to filter to only those statements starting with 3 characters and then use the regex option in parse mode to pull out the before and after [Orig].&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;PRE&gt;(\d.+) \[Orig\] (.+)&lt;/PRE&gt;&lt;P&gt;in theory it could be&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;PRE&gt;(.+) \[Orig\] (.+)&lt;/PRE&gt;&lt;P&gt;Workbook attached.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Ben&lt;/P&gt;</description>
      <pubDate>Thu, 23 Aug 2018 23:16:42 GMT</pubDate>
      <guid>https://community.alteryx.com/t5/Alteryx-Designer-Desktop-Discussions/How-to-filter-rows-that-start-with-and-then-parse-between-and/m-p/243121#M51280</guid>
      <dc:creator>BenMoss</dc:creator>
      <dc:date>2018-08-23T23:16:42Z</dc:date>
    </item>
    <item>
      <title>Re: How to filter rows that start with ### and then parse between ### and Text</title>
      <link>https://community.alteryx.com/t5/Alteryx-Designer-Desktop-Discussions/How-to-filter-rows-that-start-with-and-then-parse-between-and/m-p/243137#M51284</link>
      <description>&lt;P&gt;Thank you. I got it to work using the&amp;nbsp;less direct regex Filter formula&amp;nbsp;&lt;STRONG&gt;&lt;EM&gt;regex_match([Field 1],"^\d.*"). &amp;nbsp;&lt;/EM&gt;&lt;/STRONG&gt;&amp;nbsp;But, this (especially the regex parse tool) helps get their quicker. &amp;nbsp;Thanks.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 24 Aug 2018 00:20:04 GMT</pubDate>
      <guid>https://community.alteryx.com/t5/Alteryx-Designer-Desktop-Discussions/How-to-filter-rows-that-start-with-and-then-parse-between-and/m-p/243137#M51284</guid>
      <dc:creator>hellyars</dc:creator>
      <dc:date>2018-08-24T00:20:04Z</dc:date>
    </item>
  </channel>
</rss>

