Find and Delete duplicate source files from System
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
Hi All,
Within a directory on my machine, there are over 5000 files containing 280 GB. I discovered almost 34 GB of duplicate files by using the Directory and Unique tools.
How can I use Alteryx to help me remove duplicate source files from my system? Please share any workflows you have that can be completed with any scripting language.
Solved! Go to Solution.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
@EN6924 Are you saying you have files that are named differently but have the same data in them on your computer? If they're automatically saved and versioned, like File.xlsx vs File(1).xlsx, then you can call those original files to move to a folder full of files you want to keep using the Run Command tool in a batch macro. You just need a list of the files you want to keep. If you don't have a list of files but they're named from automatic versioning, you can pull in that data using a Directory tool and send that to the batch macro.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
Hi Prometheus
On my system, I have several files with the same name, but they are placed in different folders.
I configured my workflow in the similar way after finding a post on the Alteryx community. Approximately 17 GB of files were removed from my machine after running it. However, the remaining files remained in the system, and there are still 394 duplicate records and files.
I'm getting error messages -
Designer x64 Finished running New Workflow1.yxmd in 5.5 seconds with 11 field conversion errors
Sharing the workflow here. Need help to fix it properly.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
@EN6924 The first error is saying that there's probably special characters that aren't recognized in String but are recognized in WString. Take a look at the Data Types help page. I use it all the time to try to understand the differences between specific data types that are generally the same, such as String, WString, and V_WString: https://help.alteryx.com/20231/designer/data-types#:~:text=Alteryx%20supports%20string%2C%20numeric%.... This thread talks about this error: https://community.alteryx.com/t5/Alteryx-Designer-Desktop-Discussions/could-not-be-fully-converted-f.... The second error comes about because Alteryx is trying to convert a character recognized by the original data type (WString) and not recognized by the second data type (String). Any time you manually alter data types with a Select tool, you have to keep in mind the limitations caused by the differences between the data types. Alteryx will attempt to convert those characters, but if it can't because of the limitations of the data type you want, it'll mostly likely turn them to nulls. Take for instance the value "ABC." If you try to change the data type of the column that contains "ABC" to an integer, you'll get a null and the warning, after so many records, that the conversion error limit was reached.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
Hi Prometheus
I built a workflow which copies the duplicate files and paste them to a new folder.
But I want to cut+paste the files, that is delete the duplicate files from source location and move them to the new folder.
Can you help me here?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
@EN6924 Here's a script I've used to move files: /c move /y "Current Folder full path with file name and extension" "New Folder full path with file name and extension" . After that, you can run the script to delete the files from the old folder: /c del /y "Current Folder full path with file name and extension".
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
@EN6924 Here's a workflow for you to modify. The batch file won't attach, but this is the script: /c move /y "C:\Users\YourName\YourPath\Fake Data.xlsx" "C:\Users\YourName\YourNEWPath\Fake Data.xlsx"
