Register Members List Search Today's Posts Mark Forums Read

Reply
 
Thread Tools
  #1  
Old 14 Aug 2020, 18:51
Tim Wheatley's Avatar
Tim Wheatley Tim Wheatley is offline
 
Join Date: Nov 2001
Question Importing from old archive html files?

I ran a site 20 years ago that was lost and recently acquired the entire archive or everything that is on archive.org - it's a ton of posts, attachments, etc.

Is there any way at all to import from the archive (for example index.php?t-125231.html or whatever) into a real post in a database? Obviously expecting some data loss, non-recovery of users, etc. But is ANYTHING possible?

If not I will likely just upload the html archive and let search engines crawl it, maybe remove the links to the 'real' forum that no longer exists.
Reply With Quote
  #2  
Old 14 Aug 2020, 19:00
Dave Dave is offline
 
Join Date: Jun 2010
Real name: Dave
If you have the .html files in a relatively organized structure and format then it's possible by creating a custom PHP script that iterates over all the files.

This PHP script should utilize either the DOMDocument PHP class or regular expressions to get the content and to insert it properly into a database.
__________________
https://technidev.com - security, development, exploits, vBulletin
dave[at]technidev[dot]com

Contact me for custom vBulletin 3/4 work & server/website management.
Reply With Quote
  #3  
Old 15 Aug 2020, 03:09
Tim Wheatley's Avatar
Tim Wheatley Tim Wheatley is offline
 
Join Date: Nov 2001
Originally Posted by Dave View Post
If you have the .html files in a relatively organized structure and format then it's possible by creating a custom PHP script that iterates over all the files.

This PHP script should utilize either the DOMDocument PHP class or regular expressions to get the content and to insert it properly into a database.
Thanks very much for the reply. I'm really interested to hear it may be possible. The backups were downloaded from archivarix_com, so the formatting is quite nice I think. I'll see what I can do in terms of using find/replace and uploading it as an archive at this point...
Reply With Quote
Reply



Currently Active Users Viewing This Thread: 1 (0 members and 1 guests)
 
Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off


New To Site? Need Help?

All times are GMT. The time now is 23:46.

Layout Options | Width: Wide Color: