Our primary use case is to enable content search for the enterprise.
Solr is an open-source search platform from the Apache Lucene project designed for scalability and providing high-performance search capabilities. It is widely used for its flexibility and scalability, making it ideal for various search-driven applications.
| Product | Mindshare (%) |
|---|---|
| Solr | 5.2% |
| Elastic Search | 17.2% |
| Xapien | 12.0% |
| Other | 65.6% |
Solr is recognized for its capacity to handle large amounts of data and complex queries, offering a distributed approach to search. It enables users to perform full-text search, hit highlighting, faceted search, real-time indexing, and dynamic clustering. Solr's robust administration interfaces and extensive plugin architecture allow tailored configurations suited to specific applications. Its compatibility with Hadoop environments enhances big data handling, facilitating effective, high-volume search processing.
What are the most important features of Solr?Solr is prominently used in e-commerce, where rapid and precise search capabilities are critical for customer satisfaction. In publishing, it powers content discovery and enhances user engagement through personalized recommendations. Finance sectors utilize Solr for analyzing large datasets, optimizing search for records and transactions. Its versatility and ability to integrate seamlessly into existing IT infrastructures make it a preferred choice across these markets.
| Author info | Rating | Review Summary |
|---|---|---|
| Senior Search Engineer at a financial services firm with 51-200 employees | 4.0 | This highly stable and scalable solution significantly improved our enterprise content search with its valuable natural language capabilities, leading to 80% user adoption. While setup was complex, I recommend it, despite needing better query performance and backend improvements. |
| Data Scientist at a comms service provider with 10,001+ employees | 3.5 | I value Solr's efficient indexing, which saves significant effort. However, I find its grammar difficult, documentation lacking, and it requires more complex operations to match MongoDB's capabilities and stability. |
| Senior Software Engineer, Search at a tech services company with 501-1,000 employees | 4.0 | I found it improved our search performance and user retention with valuable features like sharding and faceting. However, SolrCloud stability, indexing speed, and scalability with increased sharding were significant issues I experienced. |
| VP of Product at a legal firm with 51-200 employees | 4.0 | I found Solr fast and flexible for indexing 600k documents with powerful search. While it improved our capabilities, its high memory usage is a con. I'd now consider search-as-a-service like Elasticsearch due to Solr's overhead. |
Our primary use case is to enable content search for the enterprise.
Our users are now able to find ETFs and their documents that are scattered across different repositories. It is proven because we have a consistent 80% of users who are using the application every month.
The most valuable feature is the ability to perform a natural language search. That is helpful because users tend to use natural language when doing research.
The performance for this solution, in terms of queries, could be improved.
Improvements with the backend and capability could be made so that it is easier for the engineers to maintain it.
Three years.
This solution is very stable. It has very few operational hiccups.
It is a highly scalable solution. It is a distributed system that can scale infinitely, horizontally.
Everybody uses the system. Eighty percent of the users in the whole enterprise, from management to ground level staff, use it at different times of the week, and at different times of the year. We do not have plans to increase our usage at this point, as it is at a comfortable level.
Technical support is pretty good. There are also internet-based community groups who are knowledgeable with respect to this solution.
We did use a previous solution, but we switched in order to cut costs and have more flexibility over the solution.
The initial setup is complex because this is a distributed system, and you have to make sure that every individual node is aware of every other node in existence. This search engine has a large capacity, so you need to make sure that there is enough buffer space.
We took one month to deploy and perform a fresh setup. Our strategy was to start with a local data center, before venturing into cross data center replicas.
A staff size of two to four people is suitable for deploying and maintaining the solution, depending upon the scale. They would set up the solution and put monitoring in place for the indexing jobs, as well as design the schema so that the data can feed well.
We used a service provider to assist with the migration and setup.
The only costs in addition to the standard licensing fees are related to the hardware, depending on whether it is cloud-based, or on-premise.
My advice to anyone interested in implementing this solution is to make sure that somebody who is experienced with this product is on the team. It is best to get mistakes out of the way early.
This is an infinitely scalable product with state-of-the-art technology, and the value of Natural Language Search is tremendous.
I would rate this solution an eight out of ten.
We use the solution in the telecommunications industry.
One of the best aspects of the solution is the indexing. It's already indexed to all the fields in the category. We don't need to spend so much extra effort to do the indexing. It's great.
The solution's grammar and syntax should be easier.
It does take a little bit of effort to use and understand the solution. It would help us a lot if the solution offered up more documentation or tutorials to help with training or troubleshooting.
MongoDB can realize more complex operations than Solr can. Solr should add some more complex operations to the database to at least bring it up to MondoDB's level of functionality. It would make it more competitive.
There might be some compatibility issues between the data types within Solr. This needs to be improved.
Solr has a schema that we have to load the schema as an HTML, or SML file. This usually needs to be done by our engineers. It should be easier to do without needing too much technical background.
We've been using the solution for more than one and a half years.
The stability is pretty good. however, I wouldn't say that it is as stable as MongoDB. Mongo is much more reliable.
I'm unsure about the scalability of the solution. We have ten people in our team, and out of those, four people actually work on it. They're mostly data engineers. I'm not sure if we plan to increase usage or not in the future.
I haven't really dealt with technical support. I've mostly just gone through online tutorials to get help. I find those lacking, however. They need work so that they would actually be more useful for people.
We also use MongoDB. Solr is better than MongoDB because the Solr already indexes to all in their categorical field. We find that Solr also sometimes offers better performance. However, we do find that MongoDB is better for more complex operations.
I'm unsure about the initial setup. Our data engineer handled the implementation. I didn't participate in the process, so I can't speak to how straightforward or complex it was.
Our data engineer handled the process for us. It was handled in house.
We're just customers. We don't have a business relationship with the company.
I'd rate the solution seven out of ten.
It has improved our search ranking, relevancy, search performance, and user retention.
SolrCloud stability, indexing and commit speed, and real-time indexing need improvement.
With increased sharding, performance degrades. Merger, when present, is a bottle-neck. Peer-to-peer sync has issues in SolrCloud when index is incrementally updated.
Online content is available, but release cycle is relatively slow. Since it's open-source, there is no dedicated support to reach out to when there is a P1 issue.
No previous solution.
Complex in making system scalable and performant enough.
Elasticsearch.
One of the best search platforms available today.
We use Solr to index over 600k documents. It's very fast and has been flexible to use. The speed of indexing individual documents has been great. The most valuable feature has been the powerful search syntax that we use to find documents with specific attributes.
We now have a much more powerful way to search our set of documents and the open source price is right.
Memory utilization could be better but it is an industrial strength tool so some overhead is to be expected.
4 years
N/A - use books and online forums for support.
Doing this again I would look at search as a service solution. The overhead cost of running Solr on a cloud server and the additional storage should be evaluated against something like elasticsearch. Rackspace has not had a search service, but if they did we would consider it.