Search Results

Search found 14958 results on 599 pages for 'people'.

Page 238/599 | < Previous Page | 234 235 236 237 238 239 240 241 242 243 244 245  | Next Page >

  • Detecting 'stealth' web-crawlers

    - by Jacco
    What options are there to detect web-crawlers that do not want to be detected? (I know that listing detection techniques will allow the smart stealth-crawler programmer to make a better spider, but I do not think that we will ever be able to block smart stealth-crawlers anyway, only the ones that make mistakes.) I'm not talking about the nice crawlers such as googlebot and Yahoo! Slurp. I consider a bot nice if it: identifies itself as a bot in the user agent string reads robots.txt (and obeys it) I'm talking about the bad crawlers, hiding behind common user agents, using my bandwidth and never giving me anything in return. There are some trapdoors that can be constructed updated list (thanks Chris, gs): Adding a directory only listed (marked as disallow) in the robots.txt, Adding invisible links (possibly marked as rel="nofollow"?), style="display: none;" on link or parent container placed underneath another element with higher z-index detect who doesn't understand CaPiTaLiSaTioN, detect who tries to post replies but always fail the Captcha. detect GET requests to POST-only resources detect interval between requests detect order of pages requested detect who (consistently) requests https resources over http detect who does not request image file (this in combination with a list of user-agents of known image capable browsers works surprisingly nice) Some traps would be triggered by both 'good' and 'bad' bots. you could combine those with a whitelist: It trigger a trap It request robots.txt? It doest not trigger another trap because it obeyed robots.txt One other important thing here is: Please consider blind people using a screen readers: give people a way to contact you, or solve a (non-image) Captcha to continue browsing. What methods are there to automatically detect the web crawlers trying to mask themselves as normal human visitors. Update The question is not: How do I catch every crawler. The question is: How can I maximize the chance of detecting a crawler. Some spiders are really good, and actually parse and understand html, xhtml, css javascript, VB script etc... I have no illusions: I won't be able to beat them. You would however be surprised how stupid some crawlers are. With the best example of stupidity (in my opinion) being: cast all URLs to lower case before requesting them. And then there is a whole bunch of crawlers that are just 'not good enough' to avoid the various trapdoors.

    Read the article

  • Facebook Connect Statistics

    - by thekevinscott
    All, A client is asking for Facebook Connect statistical data. Specifically, how many people have shared a link to their wall. I have a Facebook app setup but I am having trouble interpreting the statistical data. Do Facebook apps collect this data, or can anyone think of any way of gleaning this data from the past month, from server logs or something? For instance, can I look at the logs for xd_receiver.htm and see usage patterns from that, or something?

    Read the article

  • Python module that implements ftps

    - by David Turner
    Hi People, I was wondering if anybody could point me towards a free ftps module for python. I am a complete newbie to python, but this is something I need for a work project. I need an ftps client to connect to a 3rd party ftps server. thanks, David.

    Read the article

  • Create Audio file on iPhone/iPad from many other audio files (mixer)

    - by Brian
    I am trying to create something similar like Piano app on the iPhone. When people tap a key, it play a piano note. Basically, there will have only 7 notes (C) at the moment. Each note is a .caf file and its length is 5 seconds. I do not know if there is any way to save the song user played and export to mp3/caf format? The AVAudioRecord seems only record from the microphone input. Many thanks

    Read the article

  • mojoPortal OR Umbraco?

    - by leen3o
    I have been look around for Free/Open Source ASP.NET CMS / Portal systems for a while now, and have seived it down to two different ones. Umbraco - http://umbraco.org mojoPortal - http://www.mojoportal.com Both look excellent and have different appealing features, but I am looking for people who have used both and which one you went with and why??

    Read the article

  • Facebook Graph API: user gender

    - by Mark
    The old Facebook API provided the user sex/gender as part of the default user data. Apparently the new Graph API does not provide that information, even though the documentation says that it does. I've heard people say that you need to request special permissions to get it and other pieces of data, but I have not been successful in getting it to work. Does anyone have an example, using the Facebook Graph API, of how to get the user's gender and/or location (city/state/country/whatever)?

    Read the article

  • php connection pooling mysql

    - by coool
    Hi, I am planning to use MYSQL. Is there a connection pooling extension available. or what is the normal practice for connection. is this the one used in every where... mysqli_connect("localhost", "xxx", "xxx", "test"); Do people use just normal msql_connect or pconnect..? how better is pconnect and what setting should I do for PConnect.... THnks

    Read the article

  • Looking for more details about "Group varint encoding/decoding" presented in Jeff's slides

    - by Mickey Shine
    I noticed that in Jeff's slides "Challenges in Building Large-Scale Information Retrieval Systems", which can also be downloaded here: http://research.google.com/people/jeff/WSDM09-keynote.pdf, a method of integers compression called "group varint encoding" was mentioned. It was said much faster than 7 bits per byte integer encoding (2X more). I am very interested in this and looking for an implementation of this, or any more details that could help me implement this by myself. I am not a pro and new to this, and any help is welcome!

    Read the article

  • Redirect help fixing broken backslash link

    - by zac
    I just accidentally launched a campaign that is driving traffic to index.html/ (backslash is breaking it)... I am not sure I can access the .htaccess and was wondering if I were to create a page so that people directed to this path would see soemthing what would the name of that page be? Since there is no name just an extra slash I am not sure how to handle this.

    Read the article

  • ConfigurationManager.OpenExeConfiguration() vs XML file

    - by Vince
    Hi, Could someone tell me the advantages to using the ConfigurationManager class which load's a config file for manipulation VS an XML file with a class you build to read it yourself? Recently, I built a class which inherits from ConfigurationSection in order to manipulate a custom section within app.config. This was quite a bit of work compared to just opening and reading an XML file. Some people chose the first approach, others chose the second. What's good practice?

    Read the article

  • Direct2D fallback to GDI in XP

    - by gilbertc
    I'd like to use Direct2D for my .Net application using the Windows Code pack. Since some of my users are using XP, I need to provide a GDI+ fallback. I wonder how people usually do this kind of fallback. Do they abstract/interface all the necessary graphics feature into some custom generic library? If so, is there any open-source library for direct2d/GDI? Thanks, Gil.

    Read the article

  • Linux Kernel Threads - scheduler

    - by Kartlee
    Hi People, Is Linux Kernel scheduler a part of init process? My understanding is that it is part of Kernel threads managed internally not visible to user by either top or ps. Please correct my understanding. Is it possible to view standard kernel threads through any kernel debugger to see how standard threads occupy cpu activity? -Kartlee

    Read the article

  • Red5 vs BlazeDS

    - by dvm
    What technology are better (robust, stable, speed) for rich web app with flash movies, chat, etc. Somebody said that Red5 will drop down if more that 50 users try to suck video stream from it. Is there people who use Java on server side and Flash for client side?

    Read the article

  • clickonce - what is a good open source alternative to clickonce? (DDay.Update)?

    - by Greg
    Hi, What is a good open source alternative to clickonce? One that is most popular and under active development I guess? DDay.Update perhaps? Is this the main one? thanks PS. I've come up with a few from searching, but would appreciate any feedback from people how have reviewed these and have an idea of which is most popular/worth looking into first. .NET Application Updater Component - http://windowsclient.net/articles/appupdater.aspx nlaunch - http://code.google.com/p/nlaunch/ dotnetautoupdate http://code.google.com/p/dotnetautoupdate/

    Read the article

  • Parameter parsing with Java Restlet

    - by Leaf
    I'm trying to make a calculator web service that has more than 10 parameters. Is there any function in the Java Restlet or in Java itself that allows you to see the parameter ids? e.g. http://mywebsite.com/calculator?id1=value1&id2=value2&id3=value3 = I'm trying to access the values of id1, id2, and id3 so that I don't have to require people using the web service to input all parameters (some of them are likely to be 0).

    Read the article

  • Test data generators / quickest route to generating solid, non-repetitive, but not-real database sam

    - by Jamo
    I need to build a quick feasibility test / proof-of-concept of a remote database for a client, that will be populated with mostly-typical Company and People data (names, addresses, etc); 150K records or so. The sample databases mentioned here were helpful: http://stackoverflow.com/questions/57068/good-databases-with-sample-data ...but, I'd like to be able to generate sample data like this easily on less-typical datasets as well. Anyone have any recommendations for off-the-shelf (or off-the-web) solutions?

    Read the article

  • What tag export formats are there?

    - by Jamie Rumbelow
    I'm writing an importer for a CMS to import tags from various platforms/sources. I wanted to be able to import tags from WordPress, Moveable Type, Blogger; basically all of the big boys. But I was also interested to see if people knew of any generic, standard tag export formats that I might be able to support. Thanks, Jamie

    Read the article

  • How to detect Browser type in Django ?

    - by AlgoMan
    How can i detect which browser type the client is using. I have a problem where i have to ask people to use different browser (Firefox) instead of IE. How can i get this information. I know http request has this information (Header). How will i get the navigator.appName from the view.py in the Django framework ?

    Read the article

  • Convert Plain Text Hyperlinks into HTML Hyperlinks in PHP

    - by Volomike
    I have a simple commenting system here... http://affbuzz.com/comments/7299a55137def55917a5dc6c4fe0f261af8a4217 ...and people can submit hyperlinks inside the plain text field. When I display these records back from the database and into the web page, what RegExp in PHP can I use to convert these links into HTML-type anchor links? Bonus: For the algorithm to not do this with any other kind of link, just http and https.

    Read the article

  • Users need Silverlight 4.0 for Expression Blend?

    - by Mohit Deshpande
    I have Visual Studio 2010 beta 2 installed and Expression Blend Preview for .NET 4. When I began to debug it, it asked me to install Silverlight 4.0 beta. So now I am wondering if people who are going to view my application need to install Silverlight 4.0 instead of Silverlight 3.5. If so, how can I downgrade from 4.0 to 3.5?

    Read the article

< Previous Page | 234 235 236 237 238 239 240 241 242 243 244 245  | Next Page >