What is the best way (in PHP) to get the page title and metatag contents of potentially millions of remote URLs in realtime?
also, is this feasible to accomplish using a single shared server?
So far I'm looking into 4 possibilities. (I'm also using Codeigniter)
You can't do millions in real time on a shared server. You'll very likely get shutdown for using too much CPU. But if you are using PHP, your best bet would be to use multi-curl. See a very similar question, which contains code sample:
It depends of your purpose. Actually in any case you should use asynchronous approach. In PHP you can try to use curl with async sockets or pcntl extension (form cgi mode). Or you can use so popular now node.js too (but it's not a PHP at all :) )
You can try with PHP Simple HTML DOM Parser. With this DOM Parser you will get the whole page content and parse the head title and meta tags.