Hello,
I have just installed drupal 4.7 on a Linux server and created several sections/subsections, taxonomy and so on.
Now, I have a *lot* of already existing articles for this website on my hard disk, in HTML format, one file per article. The question is how can I insert all of them in Drupal with a shell/mysql/python/perl/whatever script?
Basically I want to make one single tarfile, of all HTML pages and the script, upload it on the server, launch the script via ssh
and have all articles inserted in the right section/subsection, with the correct creation date (that of the original HTML file, not the date when the script was run), title and teaser, taxonomy tags etc....
One thing which is not mandatory, but of course would be a huge time saver would be smart handling of internal links. Several of these HTML files refer to each other, like "to know more, read this page (= file:///main/some_section/some_subsection/specific_article)". Could the script also make so that all the resulting Drupal nodes
would also link correctly to each other?
I have no problem to modify an existing script or just code myself any missing part, but of course I'd really like to start with
a working example, or at least to know what exactly I should study to do this job.
Also scripts, explanations, tricks and comments on how to pre-process the HTML files before inserting them, so they are surely Drupal/CSS compatible would be great. Consider that many of these HTML files were just generated inside OpenOffice, converting existing openoffice files with paragraph styles with the "Save as" or "Export" functions.