tag:blogger.com,1999:blog-18157064.post1361171240421802423..comments2024-03-18T02:14:57.204-07:00Comments on Google Operating System: Export Files from Google Page CreatorAlex Chituhttp://www.blogger.com/profile/02618542750965508582noreply@blogger.comBlogger8125tag:blogger.com,1999:blog-18157064.post-29161296333264618612009-05-19T05:07:00.000-07:002009-05-19T05:07:00.000-07:00Much easier way posted here http://www.ialwayscapi...Much easier way posted here http://www.ialwayscapital.com/2009/05/exportbackup-all-google-pages-files-in.htmlUnknownhttps://www.blogger.com/profile/02970627676757845736noreply@blogger.comtag:blogger.com,1999:blog-18157064.post-72213163075345700082008-10-23T05:11:00.000-07:002008-10-23T05:11:00.000-07:00Thanks for the tips, Alex!I'm just trying to expor...Thanks for the tips, Alex!<BR/><BR/>I'm just trying to export a single page that's using the default settings (nothing custom, just some text and a few pictures using the out of the box settings). The formatting of the text comes out great, but the background changes and I loose my borders when I move the files to a different server. Do you know if Page Creator calls to outside CSS files or if it uses some type of JavaScript for formatting?Scothttps://www.blogger.com/profile/17009231642132519320noreply@blogger.comtag:blogger.com,1999:blog-18157064.post-71595349584351475332008-08-18T07:56:00.000-07:002008-08-18T07:56:00.000-07:00I used HTTrack to get pages. http://www.httrack.co...I used HTTrack to get pages. http://www.httrack.com/page/2/en/index.html It might get more than you want but it does get all published pages and links.<BR/><BR/>Gotchas to watch for are: <BR/>Character codes that GPC may have put in your script. <BR/>You probably lost any comments that a script had when you pasted into GPC, <BR/>Don't forget that the templates have special files that are included in your code seen in the "style" section seen as url(-/include/...), <BR/>Remember you hacked GPC with a script to change the style with you custom style sheet, You need to change either to link in the stylesheet or add the code to the html document. I recommend linking it in after the "style" section.<BR/>If where you are moving supports directories think about fixing your HTML code and moving files into a directory structure.<BR/>Some places don't support filenames without extensions, your going to need to add htm or html to your files.Lord of Light AZhttps://www.blogger.com/profile/06174520799117462769noreply@blogger.comtag:blogger.com,1999:blog-18157064.post-90033109752403112612008-08-18T03:45:00.000-07:002008-08-18T03:45:00.000-07:00Here's some even better code for Unix. Paste t...Here's some even better code for Unix. Paste this in a text editor, save the file as exportgpc, execute<BR/>chmod +x exportgpc and then run:<BR/>./exportgpc sitename<BR/>OR<BR/>./exportgpc sitename.googlepages.com<BR/>OR<BR/>./exportgpc http://sitename.googlepages.com<BR/><BR/>All the files are downloaded to a new directory named sitename.googlepages.com.<BR/><BR/>The code:<BR/><BR/><BR/><BR/>#!/bin/bash<BR/>#Exports the file hosted in a Google Page Creator site.<BR/><BR/>url=$1<BR/>[ $# -eq 0 ] || [ -z ${url} ] &&<BR/>echo -e "exportgpc: missing URL\nUsage: exportgpc SITENAME.\nFor example, use exportgpc sundayclub if the URL is http://sundayclub.googlepages.com." && exit 1<BR/>url=${url/http:\/\//}<BR/>url=${url/.googlepages.com\//}<BR/>url=${url/.googlepages.com/}<BR/>wget "http://"${url}".googlepages.com/sitemap.xml" -P ${url}".googlepages.com" -N<BR/>grep -E -o "<loc>.+</loc>" ${url}".googlepages.com/sitemap.xml" | sed --e "s/<\/loc>//" -e "s/<loc>//" >links.txt<BR/>wget -i links.txt -P ${url}".googlepages.com" -N<BR/>rm -f links.txtAlex Chituhttps://www.blogger.com/profile/02618542750965508582noreply@blogger.comtag:blogger.com,1999:blog-18157064.post-84834206159909131892008-08-18T00:47:00.000-07:002008-08-18T00:47:00.000-07:00@corey:This code should work in Unix and it only n...@corey:<BR/><BR/>This code should work in Unix and it only need the address of your site.<BR/><BR/>wget "http://site.googlepages.com/sitemap.xml"<BR/>grep -E -o "<loc>.+</loc>" sitemap.xml | sed --e "s/<\/loc>//" -e "s/<loc>//" >links.txt<BR/>wget -i links.txt -P site.googlepages.comAlex Chituhttps://www.blogger.com/profile/02618542750965508582noreply@blogger.comtag:blogger.com,1999:blog-18157064.post-33955994757815037802008-08-17T23:57:00.000-07:002008-08-17T23:57:00.000-07:00or if you're on unix,just create a blank filepaste...or if you're on unix,<BR/>just create a blank file<BR/>paste in all of the links<BR/>if you're using vim, just type:<BR/>:1,$s/^/wget /<BR/>and then make it executable: chmod +x filenamehere<BR/>and then run it: ./filenamehereAnonymousnoreply@blogger.comtag:blogger.com,1999:blog-18157064.post-37333899195785369892008-08-17T07:40:00.000-07:002008-08-17T07:40:00.000-07:00Another method is to use a Web crawler that crawls...Another method is to use a Web crawler that crawls through all links and create a offline (optional) version of the page.<BR/><BR/>Teleport Pro<BR/>http://www.tenmax.com/teleport/pro/home.htm<BR/><BR/>HTTrack<BR/>http://www.httrack.com/Millehttps://www.blogger.com/profile/05296337099490672698noreply@blogger.comtag:blogger.com,1999:blog-18157064.post-48543190303027438882008-08-16T14:11:00.000-07:002008-08-16T14:11:00.000-07:00I would like to suggest you to visit this page :ht...I would like to suggest you to visit this page :<BR/>http://gilles.rasigade.googlepages.com/View.htm<BR/><BR/>It is possible to visualize most of the Google Page Creator files that have been added to the sitemap.xml.<BR/><BR/>Regards,Anonymousnoreply@blogger.com