OFBiz
  1. OFBiz
  2. OFBIZ-630

Create Google SiteMap for Ecommerce component.

    Details

    • Type: New Feature New Feature
    • Status: Closed
    • Priority: Minor Minor
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: Trunk
    • Labels:
      None

      Description

      A tool for creating Sitemap of ecommerce component. This Sitemap should comply to the 0.9 (or newer) Protocol published at http://www.sitemap.org.

      Ideas and recommendations for implementation will help.

        Activity

        Hide
        Hans Bakker added a comment - - edited

        we have created an ofbiz component doing just that: creating a sitemap.xml for one or more websites in the system. It also takes advantage of the SEO friendly url's if they are present. Url's are generated for products, categories, content and static webpages.
        More info at http://www.antwebsystems.com/control/ofbiz-products

        Show
        Hans Bakker added a comment - - edited we have created an ofbiz component doing just that: creating a sitemap.xml for one or more websites in the system. It also takes advantage of the SEO friendly url's if they are present. Url's are generated for products, categories, content and static webpages. More info at http://www.antwebsystems.com/control/ofbiz-products
        Hide
        Jacques Le Roux added a comment -

        OK, I found the reason (I'm rather new with Apache and Tomcat connector). I had to put the file in the "ServerAdmin" directory (the sitemap.xml file is not related to java).

        From my investigation, I don't think that a specific tool is needed for OFBiz eCommerce. There are plenty of tools, you just have to choose the ones you prefer.

        Here are the links I prefer so far :
        Online sitemap generator : http://www.auditmypc.com/free-sitemap-generator.asp
        Sitemap validator :http://www.smart-it-consulting.com/internet/google/submit-validate-sitemap/index.htm
        Sitemap editor : http://www.sitemapdoc.com/EditMap.aspx
        And last but not least : https://www.google.com/webmasters/tools

        I close this issue. Feel free to reopen it or ask for if needed.

        Show
        Jacques Le Roux added a comment - OK, I found the reason (I'm rather new with Apache and Tomcat connector). I had to put the file in the "ServerAdmin" directory (the sitemap.xml file is not related to java). From my investigation, I don't think that a specific tool is needed for OFBiz eCommerce. There are plenty of tools, you just have to choose the ones you prefer. Here are the links I prefer so far : Online sitemap generator : http://www.auditmypc.com/free-sitemap-generator.asp Sitemap validator : http://www.smart-it-consulting.com/internet/google/submit-validate-sitemap/index.htm Sitemap editor : http://www.sitemapdoc.com/EditMap.aspx And last but not least : https://www.google.com/webmasters/tools I close this issue. Feel free to reopen it or ask for if needed.
        Hide
        Jacques Le Roux added a comment -

        Something I forgot : the smarter 3d party validators I used render an (incomprehensible) error :
        "Data at the root level is invalid. Line 1, position 1. "

        Show
        Jacques Le Roux added a comment - Something I forgot : the smarter 3d party validators I used render an (incomprehensible) error : "Data at the root level is invalid. Line 1, position 1. "
        Hide
        Jacques Le Roux added a comment -

        I'm caught by this one. I have used 2 tools
        1. Firstly Windows local GSiteCrawler (good but needs Windows and uses VB, forget it) (found in Google page : http://code.google.com/sm_thirdparty.html)
        2. Then Applet, really cool : http://www.auditmypc.com/free-sitemap-generator.asp (tool link is http://www.auditmypc.com/xml-sitemap.asp). It's not ASP only the site is (found by my myself).

        I have generated a sitemap.xml file, checked it in Eclipse using Oxygen and http://www.sitemaps.org/schemas/sitemap/0.9/sitemap.xsd. (I used this URLset <urlset xmlns="http://www.sitemaps.org/schemas/sitemap/0.9"
        xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
        xsi:schemaLocation="http://www.sitemaps.org/schemas/sitemap/0.9
        http://www.sitemaps.org/schemas/sitemap/0.9/sitemap.xsd">)
        Compatible Google, Yahoo, MSN (see http://www.sitemap.org)

        So all was smooth until I wanted to submit it in Google tools (https://www.google.com/webmasters/tools/). I had to submit it adding control/main/ to the real domain site name (eg http://www.mysite.com/control/main/sitemap.xml) because even if the file is in the OFBiz root, using Apache and Tomcat connector (AJP1
        3) you need to do so (at least in my case). Then I got caught, Google forever says that my file has not the right format (I tried many ways).

        Anybody an idea why Google does not want my file (which is correct for sure). An issue related to VirtualHost and JkMout ? I have a really simple one :
        <VirtualHost *:80>
        ServerAdmin webmaster@mysite
        ServerName mysite
        JkMount / ofbizServer
        JkMount /control ofbizServer
        JkMount /control/* ofbizServer

        #<LocationMatch "/control">
        <LocationMatch "/">

        1. list of the banned JLR 5/5/7 (did not work not sure why => use known IP addresses)
          #SetEnvIfNoCase User-Agent "^TMCrawler" banned
        2. deny them access
          #deny from env=banned
          deny from 128.241.20. #tmcrawler bot
          </LocationMatch>
          </VirtualHost>

        I suspect a problem there because I was not able to validate my sitemap.xml by 3d party sitemap validators (but the errors were incomprehensible)

        Following sitemap.org advice, I have also put a line in the robot.txt file (in OFBiz root), hoping it will work (crossing fingers).

        Thanks

        Show
        Jacques Le Roux added a comment - I'm caught by this one. I have used 2 tools 1. Firstly Windows local GSiteCrawler (good but needs Windows and uses VB, forget it) (found in Google page : http://code.google.com/sm_thirdparty.html ) 2. Then Applet, really cool : http://www.auditmypc.com/free-sitemap-generator.asp (tool link is http://www.auditmypc.com/xml-sitemap.asp ). It's not ASP only the site is (found by my myself). I have generated a sitemap.xml file, checked it in Eclipse using Oxygen and http://www.sitemaps.org/schemas/sitemap/0.9/sitemap.xsd . (I used this URLset <urlset xmlns="http://www.sitemaps.org/schemas/sitemap/0.9" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.sitemaps.org/schemas/sitemap/0.9 http://www.sitemaps.org/schemas/sitemap/0.9/sitemap.xsd ">) Compatible Google, Yahoo, MSN (see http://www.sitemap.org ) So all was smooth until I wanted to submit it in Google tools ( https://www.google.com/webmasters/tools/ ). I had to submit it adding control/main/ to the real domain site name (eg http://www.mysite.com/control/main/sitemap.xml ) because even if the file is in the OFBiz root, using Apache and Tomcat connector (AJP1 3) you need to do so (at least in my case). Then I got caught, Google forever says that my file has not the right format (I tried many ways). Anybody an idea why Google does not want my file (which is correct for sure). An issue related to VirtualHost and JkMout ? I have a really simple one : <VirtualHost *:80> ServerAdmin webmaster@mysite ServerName mysite JkMount / ofbizServer JkMount /control ofbizServer JkMount /control/* ofbizServer #<LocationMatch "/control"> <LocationMatch "/"> list of the banned JLR 5/5/7 (did not work not sure why => use known IP addresses) #SetEnvIfNoCase User-Agent "^TMCrawler" banned deny them access #deny from env=banned deny from 128.241.20. #tmcrawler bot </LocationMatch> </VirtualHost> I suspect a problem there because I was not able to validate my sitemap.xml by 3d party sitemap validators (but the errors were incomprehensible) Following sitemap.org advice, I have also put a line in the robot.txt file (in OFBiz root), hoping it will work (crossing fingers). Thanks

          People

          • Assignee:
            Jacques Le Roux
            Reporter:
            Anil K Patel
          • Votes:
            2 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved:

              Development