WE HAVE SUNSET THIS LISTSERV - Join us at collectionspace@lyrasislists.org
View all threadsHi all,
I've set up an instance of CSpace v4.3 along with Elasticsearch on the same
server. I see that CSpace automatically creates indices in ES based on
tenant binding settings. From what I gather, only newly created or edited
records get added to the ES instance. Is there a way to trigger a complete
indexing of all CSpace records? Or, does anyone have a handy script that
pulls our record data from CSpace and adds it to ES?
Also, is/will there be granular control over what gets indexed? One idea is
to expose ES for report creation (either through Jasper or another
ES-supported service) but I'm cautious about access to sensitive content.
Thanks!
Jesse
Aaagh, I'm jealous, I've been eager to try out the ES support. But I
haven't had time, so I can't help. Let us know what you find!
Ray
On Wed, Mar 30, 2016 at 6:37 PM, Jesse Martinez mjesse@gmail.com wrote:
Hi all,
I've set up an instance of CSpace v4.3 along with Elasticsearch on the
same server. I see that CSpace automatically creates indices in ES based on
tenant binding settings. From what I gather, only newly created or edited
records get added to the ES instance. Is there a way to trigger a complete
indexing of all CSpace records? Or, does anyone have a handy script that
pulls our record data from CSpace and adds it to ES?
Also, is/will there be granular control over what gets indexed? One idea
is to expose ES for report creation (either through Jasper or another
ES-supported service) but I'm cautious about access to sensitive content.
Thanks!
Jesse
Talk mailing list
Talk@lists.collectionspace.org
http://lists.collectionspace.org/mailman/listinfo/talk_lists.collectionspace.org
Jesse,
I'll be adding a "reindex" API to the next major release. I'm adding support to reindex both the Elasticsearch index and the PostgreSQL full text index. The API will look something like this:
For PostgreSQL:
Reindex everything: POST /index/fulltext
Reindex all docs of a certain type: POST /{resource-type}/index/fulltext
Reindex a specific document: POST /{resource-type}/{CSID}/index/fulltext
For ElasticSearch:
Reindex everything: POST /index/esi
Reindex all docs of a certain type: POST /{resource-type}/index/esi
Reindex a specific document: POST /{resource-type}/{CSID}/index/esi
In the meantime, you could try creating a Nuxeo plugin to reindex the existing data into Elasticsearch using this code as an example:
https://github.com/nuxeo/nuxeo/blob/afd1d6c026b3b65c8ce07788bd874b183bc20e4f/nuxeo-features/nuxeo-elasticsearch/nuxeo-elasticsearch-automation/src/main/java/org/nuxeo/ecm/automation/elasticsearch/Indexing.java and it's available since HF25.
To see examples of existing Nuxeo plugins take a look here:
https://github.com/collectionspace/services/tree/master/3rdparty/nuxeo/nuxeo-platform-listener
-Richard
From: Talk talk-bounces@lists.collectionspace.org on behalf of Ray Lee rhlee@berkeley.edu
Sent: Thursday, March 31, 2016 1:19 AM
To: Jesse Martinez
Cc: CollectionSpace Talk List
Subject: Re: [Talk] Elasticsearch indices
Aaagh, I'm jealous, I've been eager to try out the ES support. But I haven't had time, so I can't help. Let us know what you find!
Ray
On Wed, Mar 30, 2016 at 6:37 PM, Jesse Martinez <mjesse@gmail.commailto:mjesse@gmail.com> wrote:
Hi all,
I've set up an instance of CSpace v4.3 along with Elasticsearch on the same server. I see that CSpace automatically creates indices in ES based on tenant binding settings. From what I gather, only newly created or edited records get added to the ES instance. Is there a way to trigger a complete indexing of all CSpace records? Or, does anyone have a handy script that pulls our record data from CSpace and adds it to ES?
Also, is/will there be granular control over what gets indexed? One idea is to expose ES for report creation (either through Jasper or another ES-supported service) but I'm cautious about access to sensitive content.
Thanks!
Jesse
Talk mailing list
Talk@lists.collectionspace.orgmailto:Talk@lists.collectionspace.org
http://lists.collectionspace.org/mailman/listinfo/talk_lists.collectionspace.org
Did anything come of this? Does the proposed "reindex API" work?
Is anyone using the ElasticSearch capabilities yet?
Just wondering,
John
On Fri, Apr 1, 2016 at 11:14 AM, Richard Millet richard.millet@lyrasis.org
wrote:
Jesse,
I'll be adding a "reindex" API to the next major release. I'm adding
support to reindex both the Elasticsearch index and the PostgreSQL full
text index. The API will look something like this:
For PostgreSQL:
Reindex everything: POST /index/fulltext
Reindex all docs of a certain type: POST /{resource-type}/index/fulltext
Reindex a specific document: POST /{resource-type}/{CSID}/index/fulltext
For ElasticSearch:
Reindex everything: POST /index/esi
Reindex all docs of a certain type: POST /{resource-type}/index/esi
Reindex a specific document: POST /{resource-type}/{CSID}/index/esi
In the meantime, you could try creating a Nuxeo plugin to reindex the
existing data into Elasticsearch using this code as an example:
https://github.com/nuxeo/nuxeo/blob/afd1d6c026b3b65c8ce07788
bd874b183bc20e4f/nuxeo-features/nuxeo-elasticsearch/nuxeo-elasticsearch-
automation/src/main/java/org/nuxeo/ecm/automation/
elasticsearch/Indexing.java and it's available since HF25.
To see examples of existing Nuxeo plugins take a look here:
https://github.com/collectionspace/services/tree/master/
3rdparty/nuxeo/nuxeo-platform-listener
-Richard
From: Talk talk-bounces@lists.collectionspace.org on behalf of Ray
Lee rhlee@berkeley.edu
Sent: Thursday, March 31, 2016 1:19 AM
To: Jesse Martinez
Cc: CollectionSpace Talk List
Subject: Re: [Talk] Elasticsearch indices
Aaagh, I'm jealous, I've been eager to try out the ES support. But I
haven't had time, so I can't help. Let us know what you find!
Ray
On Wed, Mar 30, 2016 at 6:37 PM, Jesse Martinez mjesse@gmail.com wrote:
Hi all,
I've set up an instance of CSpace v4.3 along with Elasticsearch on the
same server. I see that CSpace automatically creates indices in ES based on
tenant binding settings. From what I gather, only newly created or edited
records get added to the ES instance. Is there a way to trigger a complete
indexing of all CSpace records? Or, does anyone have a handy script that
pulls our record data from CSpace and adds it to ES?
Also, is/will there be granular control over what gets indexed? One idea
is to expose ES for report creation (either through Jasper or another
ES-supported service) but I'm cautious about access to sensitive content.
Thanks!
Jesse