Hi all,
I am trying to remove duplicates in my in-DB data by grouping all the fields with a Summarize In-DB tool. This works if I stream the data out and stream it back in before the Summarize In-DB tool but not if I don't. As in if the pair of tools in the image below are used.

Also, if I stream all the data out and use Alteryx's Unique tool then this also successfully removes duplicates. I was hoping someone might know why this is happening and how I can get the Summarize In-DB tool to remove duplicates without having to stream data out at any point.
My theory is that streaming out changes the data in some way which cleanses it but I haven't been able to confirm this is the case or understand any logical reason for why that would happen.
Any help, much appreciated.
Thanks