Hello,
I'm working with data extracted from PDFs and using regex to format it into columns. I'm struggling with missing elements and inconsistent spacing that throw off my column alignment. Any suggestions for a more robust regex approach to handle these variations?
Does anyone help in this
Hi, @dona168lee would using the data cleansing tool to remove duplicate whitespace be of help? Could you share some data that would better help me understand the problem?
Cheers
martinson
You can make your RegEx robust enough to understand spacing and when values should exist or not. But without data like @martinson indicated we'd just be guessing
User | Count |
---|---|
106 | |
82 | |
70 | |
54 | |
40 |