Your own search engine w/ spiders?

Collapse
X
 
  • Time
  • Show
Clear All
new posts
  • qxh
    Senior Member
    • Jan 2003
    • 1865
    • 3.0.3

    Your own search engine w/ spiders?

    How would you go about starting you own "mini" search engine with spiders? Would you need SSH to actually send the spiders out or would that be done via a script? What database backend would you use? Do you even need a database backend?

    So many questions.. Does anyone have any kinda tutorials or something for it, or anymore information?

    Thanks,
    Scott
  • IDN
    Senior Member
    • Apr 2002
    • 4030
    • 3.5.x

    #2
    I don't think you're a moron
    Running vB since 4-14-2002

    Comment

    • merk
      Senior Member
      • Jul 2001
      • 4149

      #3
      Why would you do such a thing?

      Have you read about what google actually runs to achieve what they do?

      Comment

      • Game Wizards
        Senior Member
        • Jan 2003
        • 1865
        • 3.0.3

        #4
        Originally posted by IDN
        I don't think you're a moron
        LOL

        It's just a mini search engine which I need to develop for certain small sites..though it needs a spider

        Comment

        • wandrer
          Senior Member
          • Apr 2000
          • 267

          #5
          Originally posted by Game Wizards
          How would you go about starting you own "mini" search engine with spiders?
          Регистрируйтесь бесплатно в самой большой программе лояльности в стране, призы за покупки активные промокоды бесплатные книги Аптека.ру ЛитРес Здравсити Максидом Улыбка Радуги Яндекс Маркет Алиэкспрес Сбермегамаркет Яндекс Браузер


          mnoGoSearch (formerly known as UdmSearch) is a full-featured web search engine software for intranet and internet servers. mnoGoSearch for UNIX is a free software covered by the GNU General Public License and mnoGoSearch for Windows is a commercial search software version.

          mnoGoSearch software has a number of unique features, which makes it appropriate for a wide range of applications from search within your site to specialized search systems such as cooking recipes or newspaper searches, ftp archive search, MP3 search, news articles search or even national-wide portal search engine.

          Comment

          • Game Wizards
            Senior Member
            • Jan 2003
            • 1865
            • 3.0.3

            #6
            Thanks wandrer, I'll be looking into it

            Comment

            • merk
              Senior Member
              • Jul 2001
              • 4149

              #7
              I still think you need a Pigeon Cluster!

              Comment

              • Beorn
                Senior Member
                • May 2002
                • 451

                #8
                Read lynx(1) and look for -crawl and -traversal.

                Use that to spider the whole site. After that, loop through each file you downloaded. Create a search index like vB's search.

                Comment

                • merk
                  Senior Member
                  • Jul 2001
                  • 4149

                  #9
                  Totally offtopic Beorn, but shouldnt the fury line read

                  dumbasspeoplesmoney--; ??

                  smartasslawyermoney++;, too

                  Comment

                  • Beorn
                    Senior Member
                    • May 2002
                    • 451

                    #10
                    Originally posted by merk
                    Totally offtopic Beorn, but shouldnt the fury line read

                    dumbasspeoplesmoney--; ??

                    smartasslawyermoney++;, too
                    Not really, because the dumbasspeoplesmoney keeps growing as long as the court system is screwed up....like the woman who sued 7-11 because her coffee was too hot....she put it between her legs, and while driving she hit a bump, it spilled all over her and burned her. Supposedly, this is 7-11's fault.

                    Comment

                    • ShiningArcanine
                      Senior Member
                      • Feb 2003
                      • 2482
                      • 3.0.3

                      #11
                      Learn ASP.NET and then start coding. It is ideal for large applications like this. That is, if you want to code a search engine for the entire internet.

                      Comment

                      • Game Wizards
                        Senior Member
                        • Jan 2003
                        • 1865
                        • 3.0.3

                        #12
                        I don't want to use ASP, I would like to ideally use PHP and mySQL database..CGI/Perl maybe!

                        Comment

                        • Icheb
                          Senior Member
                          • Nov 2002
                          • 1291

                          #13
                          Originally posted by Game Wizards
                          I don't want to use ASP, I would like to ideally use PHP and mySQL database..CGI/Perl maybe!
                          *wonders if he knows what CGI/Perl really is*

                          Ok, never mind.

                          "Do you even need a database backend?"

                          Seriously, if you don't know that, you shouldn't start coding a search engine.

                          Comment

                          • merk
                            Senior Member
                            • Jul 2001
                            • 4149

                            #14
                            Originally posted by Beorn
                            Not really, because the dumbasspeoplesmoney keeps growing as long as the court system is screwed up....like the woman who sued 7-11 because her coffee was too hot....she put it between her legs, and while driving she hit a bump, it spilled all over her and burned her. Supposedly, this is 7-11's fault.
                            True i guess, though, you could also consider that the lawyers money grows as dumb people sue.

                            Lucky our court systems are not that bad around here

                            Comment

                            • 0ptima
                              Senior Member
                              • Jan 2002
                              • 1557

                              #15

                              Comment

                              widgetinstance 262 (Related Topics) skipped due to lack of content & hide_module_if_empty option.
                              Working...