drew liverman, doing the thing.
In: dunno
3 Jul 2008Yesterday was the first time in about a year and a half of using MODx CMS/CMF that I had a need for to use the Import HTML function (in the Manager, via Tools -> Import HTML). I was not only impressed by how fast it performed with a large batch of HTML files, but ecstatic with the yield of the function’s end results.
The task at hand was like this: Port an older, hand-coded HTML site over to a dynamic content management system platform. Ideally without a ton of data entry (by an error-prone human anyway) involved in the process, since the project’s scope didn’t account for that. It did however account for all the original primary body and page title content (with its original markup) needed to remain intact for sake of natural SEO positioning the site’s content had accumulated over the years.
220 Pages of HTML Content.
Um, no. I was pretty much gonna have to enter the data myself or find some mysql database import method that I preferred not have to Google around for another “solution” (hehe).
Once I’d given the Import HTML feature a test run on a fresh dev install and gotten good results, I was 75% finished with the task in under an hour.
All that was left was to prep the HTML pages by batch removing unwanted code (like navigation and header/footer/callout/etc).
This went like this (on a Mac, Windows users could easily follow suit with some Windows-ey yucky stuff).
I make stuff on the inter-tubes. I'm somehow or another a strange hybrid creature of half-nerd, half artsy-fartsy designer with an unruly love for typography. I heart girls, and have 4 wonderful ones. Boosh!
1 Response to Importing (a LOT of) HTML Documents with MODx CMS/CMF
amik
June 10th, 2010 at 8:23 am
I love modx. Good idea to use coda to remove the code you dont need!