The Vault Preservation Project
#26
Posté 29 septembre 2012 - 02:11
#27
Posté 29 septembre 2012 - 02:04
#28
Posté 29 septembre 2012 - 02:22
WooHOO! You're my favorite evilly grinning cat! =)
I know you have bandwidth issues Tarot. I think you're amazing doing anything at all! Maybe Brits do remember how to rock! ;-)Tarot Redhand wrote...
Rolo I would love to help but I am on pre-paid inet at the moment. Having looked through your list, I had a thought. As there is a lot of nwn2 stuff have you posted a link in the nwn2 custom content section of these boards. I'm sure that there must be people in there who would love to come on-board. Actually now I think about it, it wouldn't hurt to post a link to this thread in the modules, toolset and scripting forums of the nwn section either.
Tchos posted a link on the NwN2 forums. Cross-posting here isn't a bad idea, but my time is *really* limited today ;-/ Anyone want to do it?
Basically, all the info will have to be cut and pasted when I get the VPP content type built (sometime tomorrow). So whatever method is easiest for you to work with. You can also save it as a PDF if that works for you. The idea is to keep the structure of the Vault entries for compatibility as a mirror going forward, while migrating the content to a viable CMS (Content Management System).Couple of practicalities spring to mind.
- Do you want the pages saving as a web archive (mht) or just html with seperate pictures etc.?
- What do you want to do with multi-page comments?
<...like a fool with an invisible cat from cheshire>
#29
Posté 29 septembre 2012 - 02:25
WooHOO! My favorite... er, new friend =)
What ever would be easiest for you to cut and paste into the form when I get that ready (sometime tomorrow - see response to Tarot Redhand).Eric of Atrophy wrote...
A thought - would .pdf captures of the pages be workable? In addition to getting the files, of course...
<...like a whithered old man>
#30
Posté 29 septembre 2012 - 02:29
(see my response to Virusman) Basically, I couldn't get it working efficiently. After several hours of spidering down evry link (including ads) I had very few pages of actual projects. If someone has a lot better luck (or skill), that would be a great option.kamal_ wrote...
Why not use a web scraper?
On the otherhand, I'm getting damn good vibes from the peoplerallying 'round. It really demonstrates the great beating heart of this community =)
Can't beat that with a stick!
<...frustrated with wget>
#31
Posté 29 septembre 2012 - 02:45
Modifié par Pstemarie, 29 septembre 2012 - 02:45 .
#32
Posté 29 septembre 2012 - 04:16
<ducking all the flying cow chips I see coming my way>
I know, I know, "editing" is a pain, and sometimes considered negative, BUTT (where is that smiley with the shaking derriere anyway?), BUTT, removing extraneous stuff for something important like this could really prove invaluable. Not sure how to handle that without further discussion, as a community needs to feel like they "belong." BUTT the extraneous posts make it sometimes very frustrating for someone just looking for an "answer" instead of a discussion about how pretty the MOON is tonight.
Also, beyond removing extraneous postings, editing for spelling and clarity might also prove useful. Most especially for important aspects of the thread. Things like spelling issues where someone accidentally typed .DDA when we ALL know they meant .DDS instead, excpet of course the newbie that is left desperately searching for that danged .DDA file that is causing all the problems and can't find any anywhere.
<lift my head to see if the incoming is done>
<<smack>>
<spluttering, dang it, knew I should have kept my mouth shut!>
If this "group" of folks copying this stuff and formalizing it into a final format wishes, I will offer my services as an editor prior to final posting wherever you all decide to post the online version. Provided of course, that whatever format/document style you decide upon is something that I can fairly easily figure out and use. (I hate .wiki as .wiki may be powerful, but takes a HUGE amount of extra code typing to make it work.) BBCODE comes to mind as a fairly simple, yet still very powerful way to offer formatting, assuming (yes, I know, A S S out of me) of course that the final product is intended as a web available resource, not a .PDF which requires a special reader and is a bit of a pain to create as it typically also requires a paid for application. (Remember, I am in hospital, waiting for medicade to decide whether or not I get to keep my leg or they will just cut it off since I don't have insurance, a job, nor any fut.ure hope of any such).
Anyway, I have time, and some limited skill, and I can read/write with the best 4th grader, so I will help if I can.
<after finally getting my mouth clear, I peak over the top again, and notice something that looks like an automated firing mechanism aimed directly at me but just can't seem to duck down fast enough as it fires again>
Edit: Sorry Rolo, attempting to attempt to follow your weird double or is it triple talking style with very limited success.
#33
Posté 29 septembre 2012 - 04:33
1: Grab the creature hak + authors information + a screenshot
2: Create an entry on our new cc page with a title like "berzerk ogre by "athors name"
3: upload the hak/ rar or zip and just paste the original authors information + a screenshot into the entry page.
So every content has one page for it´s own with a title like on the vault.
Thats it basically!
p.s.
and a link to the original vault entry of course!!!
pp.s.
and maybe we state one single time somewhere on the mainpage that we do this w/o the peoples explicit permission and just for the safety of their availability if something will happen (what we dont hope...) to the vaults entry one time in the future.
Modifié par NWN_baba yaga, 29 septembre 2012 - 04:42 .
#34
Posté 29 septembre 2012 - 04:41
*shakes head sadly at BB's pitiful attempt* My dear... er, beast. It's really quite simple. <for him>
I say things and the bird... <that's lady stormshadow, wizard. since you're being all patronizing and everything>
...whom I call Bother for very good reason... <keep it up, old man. *i* know how to operate the chip-chucker>
...comments. Occasionally, Cestus Dei... <stumpy & grumpy to his... friends>
...also er, chips in. :-) <graceful, boss. smooth, even>
So you see, extraneous text can add flavor, ambiance, humor and interest. :-) <eye-strain>
That, too.
In regards to the VPP, the framework I'm using (drupal) has great search functions and much more editing control for authors over their own posts. That said, I understand what you are talking about.
I have made several of the volunteers "Moderators" (I have no problem making moderators - You're one =) so they *can* do editing if it really warrants it - as in the case of obvious misspellings, etc.
OTOH, any editing done is at the discretion of the author and, if necessary, editing can be reversed (all revisions are stored and can be rolled back at need).
On the third hand, you produce great tutorials and are a constant source of reference. If you volunteer to er, "massage" posts and authors agree, the community will benefit. And you *do* have some time... though the reason for that still angers me :-/ <easy, boss>
Heh. Right.
Short: You *have* the *power*, he-beast! Now you just have to exercise it responsibly, eh?
<...of extraneous stuff for flavor>
#35
Posté 29 septembre 2012 - 04:50
The form entry will mirror the Vaults current form. The same info will need to be entered (including Author). The idea is to mirror, to *preserve* the Vault, so it will be very much the same.NWN_baba yaga wrote...
Well i had something very easy in mind that is like this process:
1: Grab the creature hak + authors information + a screenshot
2: Create an entry on our new cc page with a title like "berzerk ogre by "athors name"
3: upload the hak/ rar or zip and just paste the original authors information + a screenshot into the entry page.
So every content has one page for it´s own with a title like on the vault.
I should have pointed out earlier that I *am* an admin on the Vault (until they give me the boot :-P ) and I have no intention of abusing that trust. This entire project is not to take anything away from the Vault, but to safeguard it. I consider all of the content we are preserving to retain the rights granted and reserved by both the original author and the Vault. That will be posted prominantly in the VPP.p.s.
and a link to the original vault entry of course!!!
pp.s.
and maybe we state one single time somewhere on the mainpage that we do this w/o the peoples explicit permission and just for the safety of their availability if something will happen (what we dont hope...) to the vaults entry one time in the future.
<...that makes little sense>
#36
Posté 29 septembre 2012 - 05:49
Modifié par OldTimeRadio, 29 septembre 2012 - 05:51 .
#37
Posté 29 septembre 2012 - 05:58
MDA
#38
Posté 29 septembre 2012 - 06:55
Rolo Kipp wrote...
<snip>
I have made several of the volunteers "Moderators" (I have no problem making moderators - You're one =) so they *can* do editing if it really warrants it - as in the case of obvious misspellings, etc.
Interesting, since I can't seem to remember ever having an account there?
#39
Posté 29 septembre 2012 - 07:22
That said, doing it all by hand is the path to madness. This kind of thing is what computers
are for (and NWN of course). All of us have more interesting things we could be doing than
tediously and error-proneously copying dynamic web content from one form to another.
If we could get the database and access to the content storage it would be a matter of teaching
the new web interface to query and display it. If that's not possible a tool could be written to
scrape the pages and dump to a storage directory and xml or simpler key=value metadata file.
95% of the links and such on those pages is noise. I'm not surprised wget mirror choked. The interesting bits are more or less between the "<!-- Output Meta Data Rows -->" and the "<!-------Start Network Connections Box -->" comments. That significantly reduces the parsing needed and the number of links to deal with.
The forum down loader OTR mentioned might do most of that already. It may be a matter of going
back and getting the actual downloads. That may be the place to start...
If I have time I can try to come up with something. I'm more of a systems person than a web
programmer but perl is always fun...
Cheers,
Meaglyn
#40
Posté 29 septembre 2012 - 09:53
TR
#41
Posté 30 septembre 2012 - 05:49
started looking at the "scripts" section, and noticed they followed the pattern:
http://nwvault.ign.c....Detail&id=3795
where 3795 is the entry number. changing that (up or down) will yield a new entry.
so it should be a simple case to just enumerate from 1 ... n, curl-ing each page into a file.
i started work on a perl script to then scan the curl'd file until it got to the "Files" section, and i parsed out the download links. i've not tested it much, but it looks like it's gonna work.
#!/usr/bin/perl
#$page = `wget http://nwvault.ign.com/View.php?view=Scripts.Detail\\\\&id=3794`;
$page = `curl -s http://nwvault.ign.com/View.php?view=Scripts.Detail\\\\&id=3794`;
@lines = split /\\n/, $page;
$files = 0;
foreach $l (@lines){
if($l =~ /<a name="Files"><\\/a>Files/){
$files = 1;
}
# if($l =~ /Post New Comments/){
if($l =~ /<\\/TABLE>/){
$files = 0;
}
next if !$files;
if($l =~ /<a href="(fms\\/Download\\.php.*?)"/){
# print "$1\\n";
# `wget http://nwvault.ign.com/$1`;
print "wget http://nwvault.ign.com/$1\\n";
}
}
it's VERY unfinished, but it's too late for me to work on it anymore tonight.
it should be a simple exercise of saving the files to whatever directory structure/format rolo set up in his post.
i'll give it another go tomorrow and try to get a "final version" of the script done, unless somebody takes the effort over.
#42
Posté 30 septembre 2012 - 06:30
#43
Posté 30 septembre 2012 - 07:05
#44
Posté 30 septembre 2012 - 02:50
Rolo's objective was to get more than just the files themselves.
#45
Posté 30 septembre 2012 - 03:49
TR
#46
Posté 30 septembre 2012 - 04:41
Absolutely. In addition, there are many projects that are quite dead (my dear old friend Ratbert #CP# being a good, and rather sorrowful example). In this case, what is the value of the project being included at all, except for historical purposes (it does still link to the Mad Lemur's seldom used blog... *sigh* )? I *do* still wish to include them, but... Perhaps I can flag them for research to try to recover the content at leisure. Flags are easy ;-)AndarianTD wrote...
A quick observation on this effort: you may need to be careful to distinguish between content that's hosted on the Vault, and content that's linked to on the Vault but actually hosted on another site.
...
Some of us host our work on our own sites and only link to it from the Vault, and wouldn't want it to be re-hosted anywhere else.
There is also the case where several projects have links to other projects (like the haks required by a series of mods). In that case also the *links* should be preserved and *not* followed.
<...the branches that lead to infinity>
#47
Posté 30 septembre 2012 - 05:27
#48
Posté 30 septembre 2012 - 09:54
On the topic of extracting the information. Using Nvu it would appear that the information required can be split into 2 sections. The first is a table that contains 2 sub-tables. The first of these sub-tables contains the details of the submission and the second sub-table contains the submission itself. The second is table containing 0 to many sub-tables which host the comments.
TR
Modifié par Tarot Redhand, 30 septembre 2012 - 09:55 .
#49
Posté 30 septembre 2012 - 09:59
How about this then, put in the required fields (title, author and a couple others) and upload the captured page as an archive with the file. That is one file is a .7z of the snapshotted page and the other files are the downloads...? That way the stuff is there, just not convenient... as you said.
Edit: Did two samples of the textures. Ugly but working.
<...waving from the hole he's dug himself into>
Modifié par Rolo Kipp, 01 octobre 2012 - 12:15 .
#50
Posté 30 septembre 2012 - 10:16
Ok, so far I've built (bare bones, no tweaks to display or anything cool :-P ):
- NwN Character
- NwN Screenshots
- NwN Creatures
- NwN Models
- NwN Other
- NwN Textures
Note: Keeping the Vault's hierarchy, the add content menu is something like:
Add Content -> VPP -> NwN -> Community (nothing there) -> Files -> Characters -> Creatures -> ModelsEtc. Hover over the menu to the left and explore :-P
<...for air...and java>
Modifié par Rolo Kipp, 30 septembre 2012 - 10:21 .





Retour en haut






