<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic workeRe: Pulling Billions of Data to Store to then Use in Alteryx Designer Desktop Discussions</title>
    <link>https://community.alteryx.com/t5/Alteryx-Designer-Desktop-Discussions/Pulling-Billions-of-Data-to-Store-to-then-Use/m-p/1379211#M341880</link>
    <description>&lt;P&gt;This worked beautifully, thank you!&lt;/P&gt;</description>
    <pubDate>Thu, 20 Mar 2025 12:29:55 GMT</pubDate>
    <dc:creator>nafong</dc:creator>
    <dc:date>2025-03-20T12:29:55Z</dc:date>
    <item>
      <title>Pulling Billions of Data to Store to then Use</title>
      <link>https://community.alteryx.com/t5/Alteryx-Designer-Desktop-Discussions/Pulling-Billions-of-Data-to-Store-to-then-Use/m-p/1378815#M341772</link>
      <description>&lt;P&gt;I wanted some help or suggestions on how I should approach this issue I am faced:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;Currently, I am using my workflow to connect to a database (Impala SQL) by using the Dynamic Input tool. I have around 15 distinct queries that pull different information per person depending on the Group ID I input into the Text Box Interface tool (replaces a "dummy group id" from the query with the name I input into the text box).&lt;/LI&gt;&lt;LI&gt;It takes about 50 minutes to fully run them, as other people are running queries from the same data-lake, which is restricted by how many queries can be run at once because of a memory limit.&amp;nbsp;&lt;/LI&gt;&lt;LI&gt;Essentially, the queries pull billions of rows of data, searching to find a snippet of data that matches certain conditions, and then throw away the billions of rows away.. multiplied by how many people are running similar queries at once&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Is there a way that I can pull the billions of rows once, and then look for those snippets of data matching certain conditions without "throwing it away" to save on run-time but also not be restricted by memory issues?&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Ideas:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;Pulling once into a csv file and then using that file to reference&lt;UL&gt;&lt;LI&gt;May not be feasible because they are limited to about 2 million rows&lt;/LI&gt;&lt;/UL&gt;&lt;/LI&gt;&lt;LI&gt;Using the Cache Dataset Macro -- is this the right way to approach this?&amp;nbsp;&lt;UL&gt;&lt;LI&gt;How can I use this tool so that people without an Alteryx license can use the workflow through the Alteryx server?&amp;nbsp;&lt;/LI&gt;&lt;LI&gt;How can I set it up in a way that the large dataset is only run once, and when another user uses the Application portion of it (input a Group ID in a textbox), it starts running from what was cached?&lt;/LI&gt;&lt;/UL&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I appreciate any ideas or help provided. Thank you!&lt;/P&gt;</description>
      <pubDate>Tue, 18 Mar 2025 18:24:34 GMT</pubDate>
      <guid>https://community.alteryx.com/t5/Alteryx-Designer-Desktop-Discussions/Pulling-Billions-of-Data-to-Store-to-then-Use/m-p/1378815#M341772</guid>
      <dc:creator>nafong</dc:creator>
      <dc:date>2025-03-18T18:24:34Z</dc:date>
    </item>
    <item>
      <title>Re: Pulling Billions of Data to Store to then Use</title>
      <link>https://community.alteryx.com/t5/Alteryx-Designer-Desktop-Discussions/Pulling-Billions-of-Data-to-Store-to-then-Use/m-p/1378847#M341775</link>
      <description>&lt;P&gt;You could try running it and storing the data as a yxdb file.&amp;nbsp; Then have a separate workflow that references the yxdb file.&amp;nbsp; &amp;nbsp;I've also broken up data and stored them in separate files based on year (for example).&amp;nbsp; Then, brought those individual files in together to continue the research.&amp;nbsp;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 18 Mar 2025 19:46:12 GMT</pubDate>
      <guid>https://community.alteryx.com/t5/Alteryx-Designer-Desktop-Discussions/Pulling-Billions-of-Data-to-Store-to-then-Use/m-p/1378847#M341775</guid>
      <dc:creator>lwolfie</dc:creator>
      <dc:date>2025-03-18T19:46:12Z</dc:date>
    </item>
    <item>
      <title>workeRe: Pulling Billions of Data to Store to then Use</title>
      <link>https://community.alteryx.com/t5/Alteryx-Designer-Desktop-Discussions/Pulling-Billions-of-Data-to-Store-to-then-Use/m-p/1379211#M341880</link>
      <description>&lt;P&gt;This worked beautifully, thank you!&lt;/P&gt;</description>
      <pubDate>Thu, 20 Mar 2025 12:29:55 GMT</pubDate>
      <guid>https://community.alteryx.com/t5/Alteryx-Designer-Desktop-Discussions/Pulling-Billions-of-Data-to-Store-to-then-Use/m-p/1379211#M341880</guid>
      <dc:creator>nafong</dc:creator>
      <dc:date>2025-03-20T12:29:55Z</dc:date>
    </item>
  </channel>
</rss>

