CQ5.3-5.4 / CRX2.0-2.2
In CQ5.3, CQ5.4, and CRX2.0-2.2, do the following:
- Log in to your server and open a command prompt and change directories to crx-quickstart/server/runtime/0/_crx/WEB-INF/lib
- Run this command to extract the tika-config.xml from the jackrabbit-core jar (make sure that you have the java jdk installed with the jar command):
jar -xvf jackrabbit-core*.jar org/apache/jackrabbit/core/query/lucene/tika-config.xml
- Modify the extracted file org/apache/jackrabbit/core/query/lucene/tika-config.xml and modify as needed. See the attached tika-config.xml for an example.
- jar -uvf jackrabbit-core-*.jar org/apache/jackrabbit/core/query/lucene/tika-config.xml
- Restart CQ for the changes to take effect.
CQ5.5/CRX2.3
In CQ5.5, to update tika-config.xml, do the following:
- First go to the Felix Web Console http://<host>:<port>/system/console and find the
"Day CRX Sling - CRX Embedded Repository com.day.crx.sling.server" bundle.
- Copy the ID number of the bundle, this is the number on the left side.
- Log in to your server and open a command prompt.
- Change directories to the location where the bundle is stored (<id> is the id number from step 2):
cd crx-quickstart/launchpad/felix/bundle<id>
- Change directories to where the embedded jars are persisted using this command (your versionX.Y folder may have a higher version than 0.0):
cd version0.0/bundle.jar-embedded/
- Run this command to extract the tika-config.xml file from the jar file (your jackrabbit-core jar may have a higher version than 2.4.0):
jar -xvf jackrabbit-core-2.4.0.jar org/apache/jackrabbit/core/query/lucene/tika-config.xml
- Modify the extracted file org/apache/jackrabbit/core/query/lucene/tika-config.xml and modify as needed. See the attached tika-config.xml for an example.
- To update the xml file in the jackrabbit-core jar, run this command:
jar -uvf jackrabbit-core-2.4.0.jar org/apache/jackrabbit/core/query/lucene/tika-config.xml
- Restart CQ for the changes to take effect.
WARNING: By disabling this feature you will no longer be able to find PDF or Office documents by searching CQ using terms contained within the contents of files.