Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-427

protocol-smb: plugin protocol implementing the CIFS/SMB protocol. This protocol allows Nutch to crawl Microsoft Windows Shares remotely using the CIFS/SMB protocol implmentation.

    Details

    • Type: New Feature
    • Status: Closed
    • Priority: Minor
    • Resolution: Won't Fix
    • Affects Version/s: 0.8.1, 0.9.0, 1.0.0
    • Fix Version/s: 1.7, 2.2
    • Component/s: fetcher
    • Labels:
      None
    • Environment:

      JAVA - OS independent

    • Patch Info:
      Patch Available

      Description

      Title: protocol-smb - Nutch protocol plugin for crawling Microsoft Windows shares
      Author: Armel T. Nene
      Update: Vadim Bauer
      Email: armel.nene NOSPAM-AT-NOSPAM idna-solutions.com, V a d i m B a u e r <AT> g m x . d e

      A. Introduction

      The protocol-smb plugins allows you to crawl Microsoft Windows shares. It implements
      the CIFS/SMB protocol which is commonly used on Microsoft OS. The plugin replicate the
      behaviour of the protocol-file over CIFS/SMB protocol. This plugin uses the JCifs library and also
      support all the properties from the JCifs library.
      You can find more information on the following site: http://jcifs.samba.org/
      The smb protocol syntax for crawling is as follow: smb://xxxxx (i.e. smb://server/share).

      B. Installation

      1) Binaries only: The protocol-smb files can be found in the ../plugins directory.
      Copy the "protocol-smb" to NUTCHHOME/build/plugins directory.
      Put the "smb.properties" file in the NUTCHHOME/conf directory.
      Configure the properties in "smb.properties" file
      Enable the plugin by updating "nutch-site.xml" file found in NUTCHHOME/conf directory
      e.g. <property>
      <name>plugin.includes</name>
      <value>protocol-smb| other plugins...</value>
      <description>
      </description>
      </property>

      2) Source code: The protocol-smb sources can be found in the ../src directory.
      Always refer to the Nutch wiki for detailed instructions on building Nutch. In short:
      Copy the 'protocol-smb' folder to NUTCHHOME/src/plugin
      Update the build.xml in NUTCHHOME/src/plugin to include plugin
      Update the NUTCHHOME/default.properties file to include plugin
      run ant to build
      Copy the 'smb.properties' file to NUTCHHOME/conf, and configure the properties
      Enable the plugin by updating the nutch-site.xml file

      C: Known Issues

      1) URLMalformedException: unkown protocol: smb

      The SMB URL protocol handler is not being successfully installed.
      In short, the jCIFS jar must be loaded by the System class loader.

      Workaround: a) a short term solutions will be to installed the JCIFS jar
      library found in protocol-smb folder in
      JDKHOME/jre/lib/ext and (or) JREHOME/lib/ext

      b) After completing step a), if the exeception is still thrown
      set the System properties by passing the following arguments
      to the JVM:

      -Djava.protocol.handler.pkgs=jcifs

      c) You can set the property also in your Code for example if
      you start Crawling with org.apache.nutch.crawl.Crawl
      Add the following two lines. This will be the Same like in b)
      public static void main(String args[]) throws Exception {
      System.setProperty("java.protocol.handler.pkgs", "jcifs");
      new java.util.PropertyPermission("java.protocol.handler.pkgs","read, write")
      //and so on

      Also you can visit the FAQ page: http://jcifs.samba.org/src/docs/faq.html

      2) FATAL smb.SMB - Could not read content of protocol: smb://xxxxxx

      This problem usually occurs if the following properties are not set correctly in
      the "smb.properties" file:

      • username
      • password
      • domain

      Also refer to the following resources for more information on the list of
      available properties and how to set them:

      http://jcifs.samba.org/src/docs/api/overview-summary.html#scp
      Also you can visit the FAQ page: http://jcifs.samba.org/src/docs/faq.html

      N.B. All properties should set in the "smb.properties" file. You can set
      all supported JCIFS properties in the "smb.properties" file.

      3) Only tested on Windows XP and Windows Server 2003. Please report any tests
      conclusion on other OS.

      1. protocol-smb.zip
        636 kB
        Armel Nene
      2. protocol-smb.zip
        649 kB
        Vadimo
      3. protocol-smb-diff.txt
        16 kB
        Ilguiz Latypov
      4. protocol-smb-dist.zip
        737 kB
        Ilguiz Latypov

        Issue Links

          Activity

          Hide
          tejasp Tejas Patil added a comment -

          Patch uses JCIFS which is licensed under LGPL. So it cannot be included in Nutch distribution. Marking this "wont fix".

          Show
          tejasp Tejas Patil added a comment - Patch uses JCIFS which is licensed under LGPL. So it cannot be included in Nutch distribution. Marking this "wont fix".
          Hide
          tejasp Tejas Patil added a comment -

          As Andrzej Bialecki mentioned earlier "This plugin uses an LGPL library, which cannot be included in Nutch repository". So this patch need not be cooked with our binaries. If anyone wants to use this, they can use the patch above. Unless there are any objections, I would prefer to mark it as wont-fix and close it.

          Show
          tejasp Tejas Patil added a comment - As Andrzej Bialecki mentioned earlier "This plugin uses an LGPL library, which cannot be included in Nutch repository". So this patch need not be cooked with our binaries. If anyone wants to use this, they can use the patch above. Unless there are any objections, I would prefer to mark it as wont-fix and close it.
          Hide
          xpow Soewandi added a comment -

          Hello,

          I've tried to use the protocol-smb plugin with nutch. The nutch read and parsed the documents correctly, but afterward, when it hit the crawldb, crawl.CrawlDbReducer, i got a lot of 'crawl.CrawlDbReducer - Missing fetch and old value, signature=[B@34d0cdd0', which causing no documents get indexed with solr ...

          Can anyone help me to pinpoint what was going on??

          Thanks

          Here's the log file:
          2012-08-29 13:54:52,641 INFO parse.ParseSegment - Parsing: smb://192.168.3.6/share/putusan/putusan_sidang_PUTUSAN 48-2011 TELAH baca.pdf
          2012-08-29 13:54:53,576 INFO parse.ParseSegment - Parsing: smb://192.168.3.6/share/putusan/putusan_sidang_Putusan 55 PUU-2010-TELAH BACA.pdf
          2012-08-29 13:54:53,612 INFO parse.ParseSegment - Parsing: smb://192.168.3.6/share/putusan/putusan_sidang_Putusan Sela 108 PHPU 2011.pdf
          2012-08-29 13:54:53,930 INFO regex.RegexURLNormalizer - can't find rules for scope 'outlink', using default
          2012-08-29 13:54:55,087 INFO parse.ParseSegment - ParseSegment: finished at 2012-08-29 13:54:55, elapsed: 00:00:28
          2012-08-29 13:54:55,103 INFO crawl.CrawlDb - CrawlDb update: starting at 2012-08-29 13:54:55
          2012-08-29 13:54:55,103 INFO crawl.CrawlDb - CrawlDb update: db: crawl/crawldb
          2012-08-29 13:54:55,103 INFO crawl.CrawlDb - CrawlDb update: segments: [crawl/segments/20120829134849]
          2012-08-29 13:54:55,103 INFO crawl.CrawlDb - CrawlDb update: additions allowed: true
          2012-08-29 13:54:55,103 INFO crawl.CrawlDb - CrawlDb update: URL normalizing: true
          2012-08-29 13:54:55,103 INFO crawl.CrawlDb - CrawlDb update: URL filtering: true
          2012-08-29 13:54:55,103 INFO crawl.CrawlDb - CrawlDb update: 404 purging: false
          2012-08-29 13:54:55,104 INFO crawl.CrawlDb - CrawlDb update: Merging segment data into db.
          2012-08-29 13:54:55,584 INFO regex.RegexURLNormalizer - can't find rules for scope 'crawldb', using default
          2012-08-29 13:54:55,765 INFO regex.RegexURLNormalizer - can't find rules for scope 'crawldb', using default
          2012-08-29 13:54:56,121 INFO regex.RegexURLNormalizer - can't find rules for scope 'crawldb', using default
          2012-08-29 13:54:56,160 INFO crawl.FetchScheduleFactory - Using FetchSchedule impl: org.apache.nutch.crawl.DefaultFetchSchedule
          2012-08-29 13:54:56,160 INFO crawl.AbstractFetchSchedule - defaultInterval=2592000
          2012-08-29 13:54:56,160 INFO crawl.AbstractFetchSchedule - maxInterval=7776000
          2012-08-29 13:54:56,198 WARN crawl.CrawlDbReducer - Missing fetch and old value, signature=[B@34d0cdd0
          2012-08-29 13:54:56,199 WARN crawl.CrawlDbReducer - Missing fetch and old value, signature=[B@78782dc6
          2012-08-29 13:54:56,199 WARN crawl.CrawlDbReducer - Missing fetch and old value, signature=[B@1a055ff4

          Show
          xpow Soewandi added a comment - Hello, I've tried to use the protocol-smb plugin with nutch. The nutch read and parsed the documents correctly, but afterward, when it hit the crawldb, crawl.CrawlDbReducer, i got a lot of 'crawl.CrawlDbReducer - Missing fetch and old value, signature=[B@34d0cdd0', which causing no documents get indexed with solr ... Can anyone help me to pinpoint what was going on?? Thanks Here's the log file: 2012-08-29 13:54:52,641 INFO parse.ParseSegment - Parsing: smb://192.168.3.6/share/putusan/putusan_sidang_PUTUSAN 48-2011 TELAH baca.pdf 2012-08-29 13:54:53,576 INFO parse.ParseSegment - Parsing: smb://192.168.3.6/share/putusan/putusan_sidang_Putusan 55 PUU-2010-TELAH BACA.pdf 2012-08-29 13:54:53,612 INFO parse.ParseSegment - Parsing: smb://192.168.3.6/share/putusan/putusan_sidang_Putusan Sela 108 PHPU 2011.pdf 2012-08-29 13:54:53,930 INFO regex.RegexURLNormalizer - can't find rules for scope 'outlink', using default 2012-08-29 13:54:55,087 INFO parse.ParseSegment - ParseSegment: finished at 2012-08-29 13:54:55, elapsed: 00:00:28 2012-08-29 13:54:55,103 INFO crawl.CrawlDb - CrawlDb update: starting at 2012-08-29 13:54:55 2012-08-29 13:54:55,103 INFO crawl.CrawlDb - CrawlDb update: db: crawl/crawldb 2012-08-29 13:54:55,103 INFO crawl.CrawlDb - CrawlDb update: segments: [crawl/segments/20120829134849] 2012-08-29 13:54:55,103 INFO crawl.CrawlDb - CrawlDb update: additions allowed: true 2012-08-29 13:54:55,103 INFO crawl.CrawlDb - CrawlDb update: URL normalizing: true 2012-08-29 13:54:55,103 INFO crawl.CrawlDb - CrawlDb update: URL filtering: true 2012-08-29 13:54:55,103 INFO crawl.CrawlDb - CrawlDb update: 404 purging: false 2012-08-29 13:54:55,104 INFO crawl.CrawlDb - CrawlDb update: Merging segment data into db. 2012-08-29 13:54:55,584 INFO regex.RegexURLNormalizer - can't find rules for scope 'crawldb', using default 2012-08-29 13:54:55,765 INFO regex.RegexURLNormalizer - can't find rules for scope 'crawldb', using default 2012-08-29 13:54:56,121 INFO regex.RegexURLNormalizer - can't find rules for scope 'crawldb', using default 2012-08-29 13:54:56,160 INFO crawl.FetchScheduleFactory - Using FetchSchedule impl: org.apache.nutch.crawl.DefaultFetchSchedule 2012-08-29 13:54:56,160 INFO crawl.AbstractFetchSchedule - defaultInterval=2592000 2012-08-29 13:54:56,160 INFO crawl.AbstractFetchSchedule - maxInterval=7776000 2012-08-29 13:54:56,198 WARN crawl.CrawlDbReducer - Missing fetch and old value, signature=[B@34d0cdd0 2012-08-29 13:54:56,199 WARN crawl.CrawlDbReducer - Missing fetch and old value, signature=[B@78782dc6 2012-08-29 13:54:56,199 WARN crawl.CrawlDbReducer - Missing fetch and old value, signature=[B@1a055ff4
          Hide
          ilatypov Ilguiz Latypov added a comment -

          Applied my diff to simplify importing into the Subversion tree. The build directory should not be imported, and the src/plugin/build.xml file should only add the new protocol-smb deploy and clean targets.

          The previous author did not grant the license to ASF.

          Show
          ilatypov Ilguiz Latypov added a comment - Applied my diff to simplify importing into the Subversion tree. The build directory should not be imported, and the src/plugin/build.xml file should only add the new protocol-smb deploy and clean targets. The previous author did not grant the license to ASF.
          Hide
          ilatypov Ilguiz Latypov added a comment -

          I hesitate adding the .zip file because (a) it hides the intention of the change and (b) other developers who might have already modified their copies would have difficulty merging my change.

          I believe the GNU patch tool will apply my suggested change automatically, provided that one resides in the right working directory and, possibly, applies the -pX option where X is the number of upper level directory names to ignore in the patch.

          Show
          ilatypov Ilguiz Latypov added a comment - I hesitate adding the .zip file because (a) it hides the intention of the change and (b) other developers who might have already modified their copies would have difficulty merging my change. I believe the GNU patch tool will apply my suggested change automatically, provided that one resides in the right working directory and, possibly, applies the -pX option where X is the number of upper level directory names to ignore in the patch.
          Hide
          vincent.couturier Vincent Couturier added a comment -

          The last attached zip does not contain the changes of Ilquiz Latypov. It's necessary to patch the zip with the protocol-smb-diff.txt. I will try to put a patched version but if Iluqiz can put his updated version it would be easier.

          Show
          vincent.couturier Vincent Couturier added a comment - The last attached zip does not contain the changes of Ilquiz Latypov. It's necessary to patch the zip with the protocol-smb-diff.txt. I will try to put a patched version but if Iluqiz can put his updated version it would be easier.
          Hide
          ilatypov Ilguiz Latypov added a comment - - edited

          Fixed reading of SMB files, updated to jcifs 1.3.0, enhanced the smoke
          test app. Protected special characters such as apostrophe and hash
          mark with URL encoding.

          Fixed the infinite retry loop in SMB.java.

          Tried but could not activate the Apache logging.

          Show
          ilatypov Ilguiz Latypov added a comment - - edited Fixed reading of SMB files, updated to jcifs 1.3.0, enhanced the smoke test app. Protected special characters such as apostrophe and hash mark with URL encoding. Fixed the infinite retry loop in SMB.java. Tried but could not activate the Apache logging.
          Hide
          jhurley12 Joe Hurley added a comment -

          Is there a reason why this plugin only handles directories? I had to make the following changes to enable file crawling:

          in SMBResponse.java:
          replace `byte[] byte` with `this.content` on line 163
          remove lines 206 and 209

          also It got stuck in the file not found case. After examining the protocol-file code, I moved the else statement in SMB.java, lines 76 and 77 outside of the curly bracket on line 78. After this change, the code could continue after encountering a file not found rather than looping forever.

          And since then, it seems to work nicely on Windows Vista. Thanks for the plugin!

          Show
          jhurley12 Joe Hurley added a comment - Is there a reason why this plugin only handles directories? I had to make the following changes to enable file crawling: in SMBResponse.java: replace `byte[] byte` with `this.content` on line 163 remove lines 206 and 209 also It got stuck in the file not found case. After examining the protocol-file code, I moved the else statement in SMB.java, lines 76 and 77 outside of the curly bracket on line 78. After this change, the code could continue after encountering a file not found rather than looping forever. And since then, it seems to work nicely on Windows Vista. Thanks for the plugin!
          Hide
          vadimbauer Vadimo added a comment -

          The update fixes some issues which I had with the old version by trying to use it with Nutch 1.0-dev

          Show
          vadimbauer Vadimo added a comment - The update fixes some issues which I had with the old version by trying to use it with Nutch 1.0-dev
          Hide
          vadimbauer Vadimo added a comment -

          This is an update to the previous Version. check the Included readme.txt

          Title: protocol-smb - Nutch protocol plugin for crawling Microsoft Windows shares
          Author: Armel T. Nene
          Update: Vadim Bauer
          Email: armel.nene NOSPAM-AT-NOSPAM idna-solutions.com, V a d i m B a u e r <AT> g m x . d e

          A. Introduction

          The protocol-smb plugins allows you to crawl Microsoft Windows shares. It implements
          the CIFS/SMB protocol which is commonly used on Microsoft OS. The plugin replicate the
          behaviour of the protocol-file over CIFS/SMB protocol. This plugin uses the JCifs library and also
          support all the properties from the JCifs library.
          You can find more information on the following site: http://jcifs.samba.org/
          The smb protocol syntax for crawling is as follow: smb://xxxxx (i.e. smb://server/share).

          B. Installation

          1) Binaries only: The protocol-smb files can be found in the ../plugins directory.
          Copy the "protocol-smb" to NUTCHHOME/build/plugins directory.
          Put the "smb.properties" file in the NUTCHHOME/conf directory.
          Configure the properties in "smb.properties" file
          Enable the plugin by updating "nutch-site.xml" file found in NUTCHHOME/conf directory
          e.g. <property>
          <name>plugin.includes</name>
          <value>protocol-smb| other plugins...</value>
          <description>
          </description>
          </property>

          2) Source code: The protocol-smb sources can be found in the ../src directory.
          Always refer to the Nutch wiki for detailed instructions on building Nutch. In short:
          Copy the 'protocol-smb' folder to NUTCHHOME/src/plugin
          Update the build.xml in NUTCHHOME/src/plugin to include plugin
          Update the NUTCHHOME/default.properties file to include plugin
          run ant to build
          Copy the 'smb.properties' file to NUTCHHOME/conf, and configure the properties
          Enable the plugin by updating the nutch-site.xml file

          C: Known Issues

          1) URLMalformedException: unkown protocol: smb

          The SMB URL protocol handler is not being successfully installed.
          In short, the jCIFS jar must be loaded by the System class loader.

          Workaround: a) a short term solutions will be to installed the JCIFS jar
          library found in protocol-smb folder in
          JDKHOME/jre/lib/ext and (or) JREHOME/lib/ext

          b) After completing step a), if the exeception is still thrown
          set the System properties by passing the following arguments
          to the JVM:

          -Djava.protocol.handler.pkgs=jcifs

          c) You can set the property also in your Code for example if
          you start Crawling with org.apache.nutch.crawl.Crawl
          Add the following two lines. This will be the Same like in b)
          public static void main(String args[]) throws Exception {
          System.setProperty("java.protocol.handler.pkgs", "jcifs");
          new java.util.PropertyPermission("java.protocol.handler.pkgs","read, write")
          //and so on

          Also you can visit the FAQ page: http://jcifs.samba.org/src/docs/faq.html

          2) FATAL smb.SMB - Could not read content of protocol: smb://xxxxxx

          This problem usually occurs if the following properties are not set correctly in
          the "smb.properties" file:

          • username
          • password
          • domain

          Also refer to the following resources for more information on the list of
          available properties and how to set them:

          http://jcifs.samba.org/src/docs/api/overview-summary.html#scp
          Also you can visit the FAQ page: http://jcifs.samba.org/src/docs/faq.html

          N.B. All properties should set in the "smb.properties" file. You can set
          all supported JCIFS properties in the "smb.properties" file.

          3) Only tested on Windows XP and Windows Server 2003. Please report any tests
          conclusion on other OS.

          Show
          vadimbauer Vadimo added a comment - This is an update to the previous Version. check the Included readme.txt Title: protocol-smb - Nutch protocol plugin for crawling Microsoft Windows shares Author: Armel T. Nene Update: Vadim Bauer Email: armel.nene NOSPAM-AT-NOSPAM idna-solutions.com, V a d i m B a u e r <AT> g m x . d e A. Introduction The protocol-smb plugins allows you to crawl Microsoft Windows shares. It implements the CIFS/SMB protocol which is commonly used on Microsoft OS. The plugin replicate the behaviour of the protocol-file over CIFS/SMB protocol. This plugin uses the JCifs library and also support all the properties from the JCifs library. You can find more information on the following site: http://jcifs.samba.org/ The smb protocol syntax for crawling is as follow: smb://xxxxx (i.e. smb://server/share). B. Installation 1) Binaries only: The protocol-smb files can be found in the ../plugins directory. Copy the "protocol-smb" to NUTCHHOME/build/plugins directory. Put the "smb.properties" file in the NUTCHHOME/conf directory. Configure the properties in "smb.properties" file Enable the plugin by updating "nutch-site.xml" file found in NUTCHHOME/conf directory e.g. <property> <name>plugin.includes</name> <value>protocol-smb| other plugins...</value> <description> </description> </property> 2) Source code: The protocol-smb sources can be found in the ../src directory. Always refer to the Nutch wiki for detailed instructions on building Nutch. In short: Copy the 'protocol-smb' folder to NUTCHHOME/src/plugin Update the build.xml in NUTCHHOME/src/plugin to include plugin Update the NUTCHHOME/default.properties file to include plugin run ant to build Copy the 'smb.properties' file to NUTCHHOME/conf, and configure the properties Enable the plugin by updating the nutch-site.xml file C: Known Issues 1) URLMalformedException: unkown protocol: smb The SMB URL protocol handler is not being successfully installed. In short, the jCIFS jar must be loaded by the System class loader. Workaround: a) a short term solutions will be to installed the JCIFS jar library found in protocol-smb folder in JDKHOME/jre/lib/ext and (or) JREHOME/lib/ext b) After completing step a), if the exeception is still thrown set the System properties by passing the following arguments to the JVM: -Djava.protocol.handler.pkgs=jcifs c) You can set the property also in your Code for example if you start Crawling with org.apache.nutch.crawl.Crawl Add the following two lines. This will be the Same like in b) public static void main(String args[]) throws Exception { System.setProperty("java.protocol.handler.pkgs", "jcifs"); new java.util.PropertyPermission("java.protocol.handler.pkgs","read, write") //and so on Also you can visit the FAQ page: http://jcifs.samba.org/src/docs/faq.html 2) FATAL smb.SMB - Could not read content of protocol: smb://xxxxxx This problem usually occurs if the following properties are not set correctly in the "smb.properties" file: username password domain Also refer to the following resources for more information on the list of available properties and how to set them: http://jcifs.samba.org/src/docs/api/overview-summary.html#scp Also you can visit the FAQ page: http://jcifs.samba.org/src/docs/faq.html N.B. All properties should set in the "smb.properties" file. You can set all supported JCIFS properties in the "smb.properties" file. 3) Only tested on Windows XP and Windows Server 2003. Please report any tests conclusion on other OS.
          Hide
          vadimbauer Vadimo added a comment -

          There is an Error in the plugin.xml File

          the plugin id should be protocol-smb and not protocol-file!

          <?xml version="1.0" encoding="UTF-8" ?>

          • <!-- Document : plugin.xml
            Created on : 03 January 2007, 10:41
            Author : Armel T. Nene
            Description:
            This file is used by Nutch to configure the SMB protocol

          -->

          • <plugin id="protocol-smb" name="SMB Protocol Plug-in" version="1.0.0" provider-name="iDNA Solutions LTD">
          • <runtime>
          • <library name="protocol-smb.jar">
            <export name="*" />
            </library>
            <library name="jcifs-1.2.12.jar" />
            </runtime>
          • <requires>
            <import plugin="nutch-extensionpoints" />
            </requires>
          • <extension id="org.apache.nutch.protocol.smb" name="SMBProtocol" point="org.apache.nutch.protocol.Protocol">
          • <implementation id="org.apache.nutch.protocol.smb.SMB" class="org.apache.nutch.protocol.smb.SMB">
            <parameter name="protocolName" value="SMB" />
            </implementation>
            </extension>
            </plugin>
          Show
          vadimbauer Vadimo added a comment - There is an Error in the plugin.xml File the plugin id should be protocol-smb and not protocol-file! <?xml version="1.0" encoding="UTF-8" ?> <!-- Document : plugin.xml Created on : 03 January 2007, 10:41 Author : Armel T. Nene Description: This file is used by Nutch to configure the SMB protocol --> <plugin id="protocol-smb" name="SMB Protocol Plug-in" version="1.0.0" provider-name="iDNA Solutions LTD"> <runtime> <library name="protocol-smb.jar"> <export name="*" /> </library> <library name="jcifs-1.2.12.jar" /> </runtime> <requires> <import plugin="nutch-extensionpoints" /> </requires> <extension id="org.apache.nutch.protocol.smb" name="SMBProtocol" point="org.apache.nutch.protocol.Protocol"> <implementation id="org.apache.nutch.protocol.smb.SMB" class="org.apache.nutch.protocol.smb.SMB"> <parameter name="protocolName" value="SMB" /> </implementation> </extension> </plugin>
          Hide
          ab Andrzej Bialecki added a comment -

          New features are not critical. This plugin uses an LGPL library, which cannot be included in Nutch repository.

          Show
          ab Andrzej Bialecki added a comment - New features are not critical. This plugin uses an LGPL library, which cannot be included in Nutch repository.
          Hide
          armel Armel Nene added a comment -

          The best way is to make the plugin available on plugin central, so that
          people who needs the plugin can download it from there.

          Show
          armel Armel Nene added a comment - The best way is to make the plugin available on plugin central, so that people who needs the plugin can download it from there.
          Hide
          ab Andrzej Bialecki added a comment -

          JCIFS is licensed under LGPL, so it cannot be included in Nutch distribution. As a consequence, we could add this plugin but it wouldn't be a part of the regular build ...

          Show
          ab Andrzej Bialecki added a comment - JCIFS is licensed under LGPL, so it cannot be included in Nutch distribution. As a consequence, we could add this plugin but it wouldn't be a part of the regular build ...

            People

            • Assignee:
              Unassigned
              Reporter:
              armel Armel Nene
            • Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Development