Topic
  • 4 replies
  • Latest Post - ‏2011-11-01T18:42:33Z by Santiago Montico
Santiago Montico
Santiago Montico
8 Posts

Pinned topic Built-in features

‏2011-10-29T22:22:37Z |
Hi,

I'm creating a common analisys structure to load the analysis results of a unix folder crawler to a db2 datababase.
I'm using Omnifind Enterprise Edition V9.1
I have created he following xml to populate the database:

<?xml version="1.0" encoding="UTF-8"?>
<ns1:cas2JdbcConfiguration xmlns:ns1="http://www.ibm.com/uima/consumer/cas2jdbc/100/xml"
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
xsi:schemaLocation="http://www.ibm.com/uima/consumer/cas2jdbc/100/xml file:/C:/AnnotatorsOmnifind/examples/descriptors/cas_consumer/CasToJDBCMapping.xsd">
<databaseConnection>
<connectionUrl>db2://XXXXX:XXXXX/XXXXX</connectionUrl>
<driver type="jdbc">com.ibm.db2.jcc.DB2Driver</driver>
<driverLibraries>
<driverLibrary>XXX/db2jcc.jar</driverLibrary>
</driverLibraries>
<authentication>
<username>XXXX</username>
<password>XXXX</password>
</authentication>
</databaseConnection>
<cas2JdbcMappingSpec>
<cas2JdbcMappings>
<explicitMappings>
<explicitMappingRule applyToSubtypes="false">
<type>uima.tcas.DocumentAnnotation</type>
<table>POC_SCHEMA.DOCUMENTS_PART1</table>
<featureMappings>

<featureMapping>
<feature>coveredText()</feature>
<length>200</length>
<column>TEXT</column>
</featureMapping>

</featureMappings>
</explicitMappingRule>
</explicitMappings>
</cas2JdbcMappings>
</cas2JdbcMappingSpec>
</ns1:cas2JdbcConfiguration>
I have created a collection example and then a crawler example associated with the collection created before.
After that i have loaded the xml of the database mapping without errors but nothing happens... The crawler is started and crawled a directory for example but no data was inserted in the db2 database.. No info appears in the log of this collection... what could be wrong?
Also i want to know what are the types of features that come built-in in the product, only found some examples in the infocenter but i dont know where i can find the entire list of features..

For example, com.ibm.omnifind.types.PoliceReport is a built-in feature of Omnifind that can be used in the database mapping??

<explicitMappingRule applyToSubtypes="false">
<type>com.ibm.omnifind.types.PoliceReport</type>
<table>sample.policeReport</table>
<featureMappings>
<featureMapping>
<feature>uniqueId()</feature>
<column>policeReportId</column>
</featureMapping>
<featureMapping>
<feature>location/uniqueId()</feature>
<column>crimeLocationId</column>
</featureMapping>
</featureMappings>
<filter syntax="FeatureValue">location/coveredText()="Los Angeles"
</filter>
</explicitMappingRule>
Any help would be appreciated.
Thanks in advance, Santiago.
Updated on 2011-11-01T18:42:33Z at 2011-11-01T18:42:33Z by Santiago Montico
  • Santiago Montico
    Santiago Montico
    8 Posts

    Re: Built-in features

    ‏2011-10-31T15:47:01Z  
    Hi,

    In the index analysis section, the following exception is thrown for every document i intended to crawl:

    FFQC5802I El documento no se ha podido analizar ni indexar debido a errores. Se ha interceptado una excepción con el detalle 'java.lang.NullPointerException' y un rastreo de pila de 'org.apache.uima.resource.metadata.impl.ResourceMetaData_impl.validateConfigurationParameterDataTypeMatch(ResourceMetaData_impl.java:308)

    What could be wrong? The mapping datatypes of my features are wrong?

    <featureMapping>
    <feature>docUri()</feature>
    <length>200</length>
    <column>URI</column>
    </featureMapping>

    <featureMapping>
    <feature>coveredText()</feature>
    <length>200</length>
    <column>TEXT</column>
    </featureMapping>
    In the DB2 the columns datatypes are:

    URI - VARCHAR (200)
    TEXT - VARCHAR (200)

    I have read in an article that the DB2 database could not run in the same machine that omnifind is running for the version 8.5 but i dont know if it is mandatory in order to use the cas2Jdbc consume, this applies for Omnifind Enterprise Edition V9.1 too?

    Any help would be appreciated.

    Thanks in advance, Santiago.
  • bfoyle
    bfoyle
    29 Posts

    Re: Built-in features

    ‏2011-11-01T06:21:05Z  
    Hi,

    In the index analysis section, the following exception is thrown for every document i intended to crawl:

    FFQC5802I El documento no se ha podido analizar ni indexar debido a errores. Se ha interceptado una excepción con el detalle 'java.lang.NullPointerException' y un rastreo de pila de 'org.apache.uima.resource.metadata.impl.ResourceMetaData_impl.validateConfigurationParameterDataTypeMatch(ResourceMetaData_impl.java:308)

    What could be wrong? The mapping datatypes of my features are wrong?

    <featureMapping>
    <feature>docUri()</feature>
    <length>200</length>
    <column>URI</column>
    </featureMapping>

    <featureMapping>
    <feature>coveredText()</feature>
    <length>200</length>
    <column>TEXT</column>
    </featureMapping>
    In the DB2 the columns datatypes are:

    URI - VARCHAR (200)
    TEXT - VARCHAR (200)

    I have read in an article that the DB2 database could not run in the same machine that omnifind is running for the version 8.5 but i dont know if it is mandatory in order to use the cas2Jdbc consume, this applies for Omnifind Enterprise Edition V9.1 too?

    Any help would be appreciated.

    Thanks in advance, Santiago.
    I've tried a number of different paths of research and I think the best bet here would be to open a PMR if you can and get official assistance from support for this case.

    bf
  • Santiago Montico
    Santiago Montico
    8 Posts

    Re: Built-in features

    ‏2011-11-01T18:21:31Z  
    • bfoyle
    • ‏2011-11-01T06:21:05Z
    I've tried a number of different paths of research and I think the best bet here would be to open a PMR if you can and get official assistance from support for this case.

    bf
    Hi bfoyle,

    Thanks for the reply, i would investigate how to open a PMR as you said...

    I'm lost here... I have created a collection, then a crawler and after that i have uploaded the following xml configuration:

    <?xml version = "1.0" encoding="UTF-8"?>
    <cas2jdbc:cas2JdbcConfiguration
    + xmlns:cas2jdbc="http://www.ibm.com/uima/consumer/cas2jdbc/100/xml"+
    + xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"+
    + xsi:schemaLocation="http://www.ibm.com/uima/consumer/cas2jdbc/100/xml CasToJDBCMapping.xsd">+

    + <databaseConnection>+
    + <connectionUrl>oracle:thin:@XXXXX:XXXXX:xe</connectionUrl>+
    + <driver type="jdbc">oracle.jdbc.driver.OracleDriver</driver>+
    <driverLibraries>
    + <driverLibrary>/opt/IBM/es/omnifind/drivers/classes12.jar</driverLibrary>+
    + <driverLibrary>/opt/IBM/es/omnifind/drivers/ojdbc14.jar</driverLibrary>+
    + </driverLibraries>+
    + <authentication>+
    + <username>XXXXX</username>+
    + <password>XXXXX</password>+
    + </authentication>+
    + </databaseConnection>+

    + <cas2JdbcMappingSpec>+
    + <cas2JdbcMappings>+
    + <explicitMappings>+
    + <explicitMappingRule applyToSubtypes="false">+
    + <type>uima.tcas.DocumentAnnotation</type>+
    + <table>NFEPACK.DOCUMENTS_PART1</table>+
    + <featureMappings>+

    + <featureMapping>+
    + <feature>docUri()</feature>+
    + <length>200</length>+
    + <column>URI</column>+
    + </featureMapping>+
    + +
    + <featureMapping>+
    + <feature>coveredText()</feature>+
    + <length>200</length>+
    + <column>TEXT</column>+
    + </featureMapping>+

    + </featureMappings>+
    + </explicitMappingRule>+
    + </explicitMappings>+
    + </cas2JdbcMappings>+
    + </cas2JdbcMappingSpec>+

    </cas2jdbc:cas2JdbcConfiguration>

    After that i started the collection, crawler etc. and the error shows up..
    FFQC5802I El documento no se ha podido analizar ni indexar debido a errores. Se ha interceptado una excepción con el detalle 'java.lang.NullPointerException' y un rastreo de pila de 'org.apache.uima.resource.metadata.impl.ResourceMetaData_impl.validateConfigurationParameterDataTypeMatch(ResourceMetaData_impl.java:308) org.apache.uima.resource.metadata.impl.ResourceMetaData_impl.validateConfigurationParameterSettings(ResourceMetaData_impl.java:284) org.apache.uima.resource.metadata.impl.ResourceMetaData_impl.validateConfigurationParameterSettings(ResourceMetaData_impl.java:237) org.apache.uima.resource.impl.ConfigurationManagerImplBase.createContext(ConfigurationManagerImplBase.java:103) org.apache.uima.resource.Resource_ImplBase.initialize(Resource_ImplBase.java:128) o
    You think there is something i am missing or the procedure is correct?

    I have tested with Omnifind Enterprise Edition V 9.1 using Oracle 10g and DB2 9.1 and the same error is shown..

    Thanks a lot, Santiago.
  • Santiago Montico
    Santiago Montico
    8 Posts

    Re: Built-in features

    ‏2011-11-01T18:42:33Z  
    Hi bfoyle,

    Thanks for the reply, i would investigate how to open a PMR as you said...

    I'm lost here... I have created a collection, then a crawler and after that i have uploaded the following xml configuration:

    <?xml version = "1.0" encoding="UTF-8"?>
    <cas2jdbc:cas2JdbcConfiguration
    + xmlns:cas2jdbc="http://www.ibm.com/uima/consumer/cas2jdbc/100/xml"+
    + xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"+
    + xsi:schemaLocation="http://www.ibm.com/uima/consumer/cas2jdbc/100/xml CasToJDBCMapping.xsd">+

    + <databaseConnection>+
    + <connectionUrl>oracle:thin:@XXXXX:XXXXX:xe</connectionUrl>+
    + <driver type="jdbc">oracle.jdbc.driver.OracleDriver</driver>+
    <driverLibraries>
    + <driverLibrary>/opt/IBM/es/omnifind/drivers/classes12.jar</driverLibrary>+
    + <driverLibrary>/opt/IBM/es/omnifind/drivers/ojdbc14.jar</driverLibrary>+
    + </driverLibraries>+
    + <authentication>+
    + <username>XXXXX</username>+
    + <password>XXXXX</password>+
    + </authentication>+
    + </databaseConnection>+

    + <cas2JdbcMappingSpec>+
    + <cas2JdbcMappings>+
    + <explicitMappings>+
    + <explicitMappingRule applyToSubtypes="false">+
    + <type>uima.tcas.DocumentAnnotation</type>+
    + <table>NFEPACK.DOCUMENTS_PART1</table>+
    + <featureMappings>+

    + <featureMapping>+
    + <feature>docUri()</feature>+
    + <length>200</length>+
    + <column>URI</column>+
    + </featureMapping>+
    + +
    + <featureMapping>+
    + <feature>coveredText()</feature>+
    + <length>200</length>+
    + <column>TEXT</column>+
    + </featureMapping>+

    + </featureMappings>+
    + </explicitMappingRule>+
    + </explicitMappings>+
    + </cas2JdbcMappings>+
    + </cas2JdbcMappingSpec>+

    </cas2jdbc:cas2JdbcConfiguration>

    After that i started the collection, crawler etc. and the error shows up..
    FFQC5802I El documento no se ha podido analizar ni indexar debido a errores. Se ha interceptado una excepción con el detalle 'java.lang.NullPointerException' y un rastreo de pila de 'org.apache.uima.resource.metadata.impl.ResourceMetaData_impl.validateConfigurationParameterDataTypeMatch(ResourceMetaData_impl.java:308) org.apache.uima.resource.metadata.impl.ResourceMetaData_impl.validateConfigurationParameterSettings(ResourceMetaData_impl.java:284) org.apache.uima.resource.metadata.impl.ResourceMetaData_impl.validateConfigurationParameterSettings(ResourceMetaData_impl.java:237) org.apache.uima.resource.impl.ConfigurationManagerImplBase.createContext(ConfigurationManagerImplBase.java:103) org.apache.uima.resource.Resource_ImplBase.initialize(Resource_ImplBase.java:128) o
    You think there is something i am missing or the procedure is correct?

    I have tested with Omnifind Enterprise Edition V 9.1 using Oracle 10g and DB2 9.1 and the same error is shown..

    Thanks a lot, Santiago.
    Hi,

    Problem solved.
    You have to follow the steps that i just tell in my last post and after that i have to modified the following files in my case:
    ./home/esadmin/esdata/master_config/col_37844.indexservice/specifiers/common/cas2jdbc_annotator.xml
    ./home/esadmin/esdata/config/col_37844.indexservice/specifiers/common/cas2jdbc_annotator.xml
    The ones associated with the collection (col_37844 in my case) created to use the cas2jdbc mapping.
    I have tested in Oracle 10g and works like a charm...

    I hope this help someone else...

    Bye, Santiago