Gigablast icon
Gigablast icon

Gigablast

A distributed open source search engine and spider/crawler written in C/C++ for Linux on Intel/AMD.

Gigablast screenshot 1

Cost / License

  • Free
  • Open Source

Application type

Alerts

  • Discontinued

Platforms

  • Online
  • Self-Hosted
Discontinued

It's dead.

-
No reviews
38likes
3comments
0news articles

Features

Suggest and vote on features
  1.  Search operators
  2.  Indexed search
  3.  Distributed

Gigablast News & Activities

Highlights All activities

Recent activities

Show all activities

Comments and Reviews

   
 Post comment/review
TBayAreaPat
0

Seems fast, but if you search under the News category, results aren't dated.

Mateo Castro
0

It loads fast. Not immediate as google but it's average loading rate is 3 seconds (which is alright). Although it's results are narrow. For instance, none of results for 'webcatalog app' took me to the site were I can make the download.

maxrempel
0

Just put your search in quotation marks and you can find things by exact phrase match. google and bing dropped this service, so here is a great alternative. It works!

dimlakgorkehgz

im sure ggl did not

Featured in Lists

All different search sites.

List by Ashley Slack with 32 apps, updated

A list with 9 apps by infinitysearch without a description.

List by Infinity Search with 9 apps, updated

A list with 15 apps by 6feriolimarco without a description.

List by Francisco Ferioli Marco with 15 apps, updated

What is Gigablast?

Gigablast is a powerful, opensource, new search engine that does real-time indexing!

Features

Scalable to thousands of servers. Has scaled to over 12 billion web pages on over 200 servers. A dual quad core, with 32GB ram, and two 160GB Intel SSDs, running 8 Gigablast instances, can do about 8 qps (queries per second) on an index of 10 million pages. Drives will be close to maximum storage capacity. Doubling index size will more or less halve qps rate. (Performance metrics can be made about ten times faster but I have not got around to it yet. Drive space usage will probably remain about the same because it is already pretty efficient.) 1 million web pages requires 28.6GB of drive space. That includes the index, meta information and the compressed HTML of all the web pages. Spider rate is around 1 page per second per core. So a dual quad core can spider and index 8 pages per second which is 691,200 pages per day. 4GB of RAM required per Gigablast instance. (instance = process) Live demo at http://www.gigablast.com/ Written in C/C++ for optimal performance. Over 500,000 lines of C/C++. 100% custom. A single binary. The web server, database and everything else is all contained in this source code in a highly efficient manner. Makes administration and troubleshooting easier. Reliable. Has been tested in live production since 2002 on billions of queries on an index of over 12 billion unique web pages, 24 billion mirrored. Super fast and efficient. One of a small handful of search engines that have hit such big numbers. The only open source search engine that has. Supports all languages. Can give results in specified languages a boost over others at query time. Uses UTF-8 representation internally. Track record. Has been used by many clients. Has been successfully used in distributed enterprise software. Cached web pages with query term highlighting.

Official Links

Gigablast information

  • Developed by

    US flagMatt Wells
  • Licensing

    Open Source (Apache-2.0) and Free product.
  • Written in

  • Alternatives

    86 alternatives listed
  • Supported Languages

    • English

AlternativeTo Category

Online Services

GitHub repository

  •  1,597 Stars
  •  457 Forks
  •  95 Open Issues
  •   Updated  

Our users have written 3 comments and reviews about Gigablast, and it has gotten 38 likes

Gigablast was added to AlternativeTo by seatsea on and this page was last updated .