The link to text files aren't working correctly.
This is one of the links from the text files page:
http://www.totse.info/en/en/religion/christianity/sexingodswords192134.html
However the correct link to the text file is:
http://www.totse.info/en/religion/christianity/sexingodswords192134.html
It looks like the text files/links are being added manually?
Comments
I am not sure about all the HTML stuff, but I am pretty sure CSS can be used to make all the pages look the same without having to change everything over and over again.
Try opening the Text-Files, they use tables mate.
Anyway, I found a missing page and I fixed it
http://www.totse.info/en/technology/science_technology/index.html
I will review other pages as well. Currently we're working on making new stuff.
Still not working here for me, here is a screen shot of what I get if I click on a link on the front page.
http://img856.imageshack.us/img856/4043/110419231923.jpg
Same for me.
EDIT; Fixed
Nope, still broke.
http://www.totse.info/en/
Try now, It seems the webpage is using some sort of caching. I am having some trouble seeing any changes made to the files. Currently looking into this matter.
Yep working really well.
Love the old ad's as well, hope your getting paid for them.
* = http://tot... http://www.tot....
Also the top shortcut bar has bbs on some and forums on others. <<Not really high priority
Sorry to be bringing this all up, I spend my day poking holes in things like this so it's become second nature.
EDIT2: Search doing the same stupid page/test thing
We're not, The Text-files is a mirror. I might change it into something later but it will require ediitng 50,000+ pages.
The Totse Text-files are just a mirror at the moment. We left it like that long ago. It's pointless to edit it and make corrections. It's best to focus on this http://www.totse.info/cms/index.php and make new content. The old content is ancient and already indexed.
I know what you're trying to do but we tried it before. The HTML structure is just too fucked up for any easy editing and we can't merge data to CMS because it will create more and more problems.
I am willing to correct index issues but the highlight of Totse.info is the forums and the new articles.
Ah ok, didn't realise it was a mirror.
If you ever want to pull the text out of the html files you should google symbol soup
Thanks, will do.
When i googled Symbol Soup all I cam up with were some books for sale on Amazon. I have to think you are referring to something else.
http://www.givegoodweb.com/post/38/beautiful-soup-example-1
It can be done, I am sure in future we might see something more classic but as it stands the old layout doesn't work for the modern web standards. We're aiming at rebuilding Totse and we need to keep up with the current standards. Plus, Totse.info should have a unique identity as well.
Like erotica...
http://www.totse.info/en/erotica/default.htm
the hot topics are old...Like 'my dads a pornstar'
does "taking a break ever work"
and others...FIx it NOW!
Can't or maybe I can. But honestly it wont work because I will have to edit all the pages.
EDIT: I edited them but I can't insert the current code there.
i wonder who did it?
Yeah I was tired, mean Beautiful Soup
What format would the pages have to be in for an easy batch import into the cms?
Can't important them, they're already on other mirrors. They don't really do us any good.