Internet Marketing, Web Development and Programming Stuff

Posts Tagged ‘regex’

thumbnail

Chunk Data for Easier Scraping

Before you spend an hour writing some elaborate regular expression, try chunking the data and matching several expressions to make for a much simpler (and faster) scrape. So, assuming you’re using PHP, after you’ve pulled the data (e.g. with file_get_contents()), use preg_replace with the following regex to chunk the data into a much easier “soup” …