Search Results

Search found 1822 results on 73 pages for 'collaborative filtering'.

Page 6/73 | < Previous Page | 2 3 4 5 6 7 8 9 10 11 12 13 | Next Page >

Naive Bayesian classification (spam filtering) - Doubt in one calculation? Which one is right? Plz c

- by Microkernel

Hi guys, I am implementing Naive Bayesian classifier for spam filtering. I have doubt on some calculation. Please clarify me what to do. Here is my question. In this method, you have to calculate P(S|W) - Probability that Message is spam given word W occurs in it. P(W|S) - Probability that word W occurs in a spam message. P(W|H) - Probability that word W occurs in a Ham message. So to calculate P(W|S), should I do (1) (Number of times W occuring in spam)/(total number of times W occurs in all the messages) OR (2) (Number of times word W occurs in Spam)/(Total number of words in the spam message) So, to calculate P(W|S), should I do (1) or (2)? (I thought it to be (2), but I am not sure, so plz clarify me) I am refering http://en.wikipedia.org/wiki/Bayesian_spam_filtering for the info by the way. I got to complete the implementation by this weekend :( Thanks and regards, MicroKernel :) @sth: Hmm... Shouldn't repeated occurrence of word 'W' increase a message's spam score? In the your approach it wouldn't, right?. Lets take a scenario and discuss... Lets say, we have 100 training messages, out of which 50 are spam and 50 are Ham. and say word_count of each message = 100. And lets say, in spam messages word W occurs 5 times in each message and word W occurs 1 time in Ham message. So total number of times W occuring in all the spam message = 5*50 = 250 times. And total number of times W occuring in all Ham messages = 1*50 = 50 times. Total occurance of W in all of the training messages = (250+50) = 300 times. So, in this scenario, how do u calculate P(W|S) and P(W|H) ? Naturally we should expect, P(W|S) P(W|H)??? right. Please share your thought...

Read the article
Collaborative kernel development

- by Raffo

Hi, I have to develop a patch for the linux kernel (2.6) for an university course I'm attending. I have to develop this patch with a friend of mine and we need to edit the same files on our computers (2-3 PCs). So, we want to use something like a collaborative editor and/or a version control system. The problem is that we never used something like that and that we cannot release our software as open source till we take the exam. Obviously, we're using linux. I'm here to ask you suggestions in order to manage our work in the best way. Thank you

Read the article
Combine multiple DNS filtering result

- by Martheen Cahya Paulo

Several DNS servers provide filtering against different categories, mostly for the local government 'undesirable content', some for malware, and some for ads. What I want is to create/use a DNS server that compare against those filtering DNS servers and only provide the address if all the DNS server agrees. For example if DNS server G provide a clean, unfiltered result, DNS server N provide malware and/or pornography filter, and DNS server F provide ad filter, then G, N and F will only provide the same/similar answer if the query is not categorized as malware, pornography, or advertising. Thus, by creating a server M that only answer if all DNS server agree, M effectively filter against all those categories. Maybe it's possible to do this on DDWRT or Tomato, but if not, I would settle for a solution that can be run in Linux/Windows.

Read the article
Creating collaborative whiteboard drawing application

- by Steven Sproat

I have my own drawing program in place, with a variety of "drawing tools" such as Pen, Eraser, Rectangle, Circle, Select, Text etc. It's made with Python and wxPython. Each tool mentioned above is a class, which all have polymorphic methods, such as left_down(), mouse_motion(), hit_test() etc. The program manages a list of all drawn shapes -- when a user has drawn a shape, it's added to the list. This is used to manage undo/redo operations too. So, I have a decent codebase that I can hook collaborative drawing into. Each shape could be changed to know its owner -- the user who drew it, and to only allow delete/move/rescale operations to be performed on shapes owned by one person. I'm just wondering the best way to develop this. One person in the "session" will have to act as the server, I have no money to offer free central servers. Somehow users will need a way to connect to servers, meaning some kind of "discover servers" browser...or something. How do I broadcast changes made to the application? Drawing in realtime and broadcasting a message on each mouse motion event would be costly in terms of performance and things get worse the more users there are at a given time. Any ideas are welcome, I'm not too sure where to begin with developing this (or even how to test it)

Read the article
How to do filtering data in UITableView like 'Recents' view in Phone application?

- by Teerasej

Hi, everyone. I am planning to applied the feature like 'Recents' view in Phone application of iPhone. This view show option on the top, 'All' and 'Missed'. Default option is 'All' but if you tap 'Missed' option the entry in UITableView which aren't 'Missed call' will slide out and disappear. This is the good feature. So I know you can help me by suggest or guide me some idea (any article is good too). I try to googling around but I couldn't find any good one.

Read the article
DDD: Trying to code sorting and filtering as it pertains to Poco, Repository, DTO, and DAO using C#?

- by Dr. Zim

I get a list of items from my Repository. Now I need to sort and filter them, which I believe would be done in the Repository for efficiency. I think there would be two ways of doing this in a DDD way: Send a filter and a sort object full of conditions to the Repository (What is this called)? Repository result would produce an object with .filter and .sort methods? (This wouldn't be a POJO/POCO because it contains more than one object?). So is the answer 1, 2, or other? Could you explain why? I am leaning toward #1 because the Repository would be able to only send the data I want (or will #2 be able to delay accessing data like a LazyList?) A code example (or website link) would be very helpful. Example: Product product = repo.GetProducts(mySortObject, myFilterObject); // List of Poco product.AddFilter("price", "lessThan", "3.99"); product.AddSort("price", "descending");

Read the article
How to use LINQ To Entities for filtering when many methods are not supported?

- by Kinderchocolate

Hi, I have a table in SQL database: ID Data Value 1 1 0.1 1 2 0.4 2 10 0.3 2 11 0.2 3 10 0.5 3 11 0.6 For each unique value in Data, I want to filter out the row with the largest ID. For example: In the table above, I want to filter out the third and fourth row because the fifth and sixth rows have the same Data values but their IDs (3) are larger (2 in the third and fourth row). I tried this in Linq to Entities: IQueryable<DerivedRate> test = ObjectContext.DerivedRates.OrderBy(d => d.Data).ThenBy(d => d.ID).SkipWhile((d, index) => (index == size - 1) || (d.ID != ObjectContext.DerivedRates.ElementAt(index + 1).ID)); Basically, I am sorting the list and removing the duplicates by checking if the next element has an identical ID. However, this doesn't work because SkipWhile(index) and ElementAt(index) aren't supported in Linq to Entities. I don't want to pull the entire gigantic table into an array before sorting it. Is there a way?

Read the article
Website misclassified by websense

- by Jeff Atwood

I received the following email from a user of one of our websites: This morning I tried to log into example.com and I was blocked by websense at work because it is considered a "social networking" site or something. I assume the websense filter is maintained by a central location, so I'm hoping that by letting you guys know you can get it unblocked. per Wikipedia, Websense is web filtering or Internet content-control software. This means one (or more) of our sites is being miscategorized by websense as "social networking" and thus disallowed for access at any workplace that uses websense to control what websites their users can and cannot access during work hours. (I know, they are monsters!) How do we dispute this websense classification error, as our websites should generally be considered "information technology" and never "social networking"? How do we know what category websense has put our sites in, so we can pro-actively make sure they're not wrong?

Read the article
Filtering at server or at client?

- by ablmf

I am thinking about how to build advertise site which works like twitter. That means, most user don't not visit the site by browser, they should run a dedicated client application on their PC or smart phone. Then they set some filters about what kind of advertise they like. And when new post that fulfill their needs appear, the client will make a notification. To make that client as real time as possible, it has to poll the server within a short time interval. The problem is, should I do the filtering at the server side when client polls, or should I simply transfer all new posts to client and let client do the filtering? Making server side filtering might cause too much CPU cycles of server, but transferring every post blindly to client might waste a lot of bandwidth. Just a brain game. :)

Read the article
glTexParameter and filtering in OpenGL and GLSL?

- by sharoz

I have a couple questions about glTexParameter and filtering 1) What is the scope when applying a glTexParameter (specifically the filtering)? Here's a scenario: Bind a texture. Set the filters to LINEAR Set the texture to "Sampler1" of a shader Bind another texture. Set its filters to NEAREST Set that texture to "Sampler2" of a shader Draw When I use the textures in a shader, will one be linear and the other be nearest? Or will they both be nearest because it was called last? 2) Is it possible to set the filtering method in GLSL? Thanks in advance!

Read the article
Web Content Filtering

- by Byron Wilcox

I have recently bought a Cisco ASA 5505 for my small business, I was initially led to believe this device could do some limited website filtering that we would need. (one list of unrestricted, and a second for email only) Since it has come to my attention that it may not be able to do this what equipment or software will I need to make this happen?

Read the article
local client http proxy for monitoring and filtering

- by jrhicks

What is a robust open source local client http proxy for monitoring and filtering? The ideal proxy could be scripted and interfaced with at run time. [UPDATE] A proxy that runs on the client machine and all browsing runs through that proxy Thanks

Read the article
Connection Filtering with Multiple DNSBL's?

- by JohnyD

Is there any advantage or disadvantage to using multiple DNSBL's with ones connection filtering? I just switched from SpamHaus to BarracudaCentral but have added in SpamHaus to test for increased effectiveness. So far there is no dip in performance and no increased load on our SMTP server.

Read the article
Can't create LVM due to: not found (or ignored by filtering)

- by James

I'm planning to use LVM for KVM, and when I try to create a VG it fails, so how can I create my VG and LV ? Thanks [root@server ~]# vgcreate virtual-machines /dev/sda Device /dev/sda not found (or ignored by filtering). Unable to add physical volume '/dev/sda' to volume group 'virtual-machines'. [root@server ~]# df -h Filesystem Size Used Avail Use% Mounted on /dev/sda3 2.0T 929G 976G 49% / tmpfs 3.9G 124K 3.9G 1% /dev/shm /dev/sda1 194M 57M 128M 31% /boot [root@server ~]# pvscan No matching physical volumes found

Read the article
Bayesian content filter for vbulletin [on hold]

- by mc0e

I've been tasked with coming up with a tool to automatically flag some posts for moderator attention on a large vbulletin forum. It's not spam per se, but the task has a lot in common with the sort of handling that might be done by a spam protection plugin (a mod in vbulletin speak). There's only so much I can say, but the task does not involve bad users, so much as particular kinds of posts which the moderators need to be aware of. Filtering out user registrations and links is therefore not useful, and we are talking about posts by real human users. What I'm looking for is an existing bayesian classification plugin, or something that I can study to get an understanding of how to do the vbulletin side of the interface in order to build such a thing. Ie I'd need ways for moderators to list flagged posts, and to correct the classification of posts which have been mis-classified. Ideally I want a 3 way split with an "unsure" category in order to reduce what has to be reviewed to find any mis-classifications. Any pointers? I've searched around a bit, and so far what I've found has been more or less entirely targetted at intervening in sign-ups (mostly using stopforumspam), captchas, and use of external services like akismet which are spam specific. I'm also considering an external solution, which might be ableto be interfaced i

Read the article
Live search/filter as you type in client approach

- by Pinoniq

As an exercise for myself to practice my JavaScript "skills" I'm trying to write a client-side filter. It should be able to filter "content blocks" as the user types. By "content block", I mean a list of DomElements that each contain at least one text node - it is possible that they contain more, and even a different amount of text nodes, nested inside other nodes, etc. I've thought of 2 approaches: On page initialization, scan all nodes and store all the text in some kind of Map or a tree. Simply iterate over every item and check whether it has the string to search/filter for. One could add performance here by caching, only filtering the current remaining items if text is added, etc. Obviously, if the number of nodes is really big, option 1 will take a while to build the 'index' but it will perform faster once it is built. Option 2 however will be available right on page load since no initialization is performed. But of course it will take longer to search. So my question is: what is the best approach here? And how would one implement 'caching' and/or 'index'?

Read the article
Dashboard for collaborative science / data processing projects

- by rescdsk

Hi, Continuous Integration servers like Hudson are a pretty amazing addition to software development. I work in an academic research lab, and I'd love to apply similar principles to scientific data analysis. I want a dashboard-like view of which collections of data are fine, which ones are failing their tests (simple shell scripts, mostly), and so on. A lot like the Chromium dashboard (WARNING: page takes a long time to load). It takes work from at least 4 people, and maybe 10 or 12 hours of computer time, to bring our data (from behavioral studies) from its raw form to its final, easily-analyzed form. I've tried Hudson and buildbot, but neither is really appropriate to our workflow. We just want to run a bunch of tests on maybe fifty independent collections of subject data, and display the results nicely. SO! Does anyone have a recommendation of a way to generate this kind of report easily? Or, can you think of a good way to shoehorn this kind of workflow into a continuous integration server? Or, can you recommend a unit testing dashboard that could deal with tests that are little shell scripts rather than little functions? Thank you!

Read the article
realtime collaborative editing: mobwrite on windows7 x64

- by collabwriter

I have set up Mobwrite on my Win7 development machine using the daemon and q.py listener. The client test suite passes, but when I run the server test suite, everything fails with this sort of response: Question: U:user10259538167863824 f:0:unittest10259538167863824 R:0:Hello world Expected: u:user10259538167863824 F:0:unittest10259538167863824 D:0:=11 Actual: u:user10259538167863824 F:0:unittest10259538167863824 D:0:=11 Diff: u:user10259538167863824 ¶ F:0:unittest10259538167863824 ¶ D:0:=11¶ ¶ ¶ I am assuming it has something to do with line endings, but I don't know what to do. Can anyone shed some light on this? Thanks so much! PS: I'm running Python 2.7

Read the article
GWT Table that supports dynamic filtering

- by Holograham

This question is similar to http://stackoverflow.com/questions/161686/gwt-table-that-supports-sorting-scrolling-and-filtering However I would prefer open source and I am looking for snappy performance. I want a good way to perform dynamic filtering on rows. SmartGWT's adaptive filter looks interesting. http://www.smartclient.com/smartgwt/showcase/#grid_adaptive_filter_featured_category Anyone have any experience with this?

Read the article
Filtering code elements when analyzing source code.

- by Martin

Hi everybody, Currently I am making a survey about source code analysis and the thing that puzzles me greatly is what is it that project managers and developers would like to filter when analyzing source code (especially when applying OOP metrics - e.g. skpping insignificant methods and classes during analysis or filtering context-based elements according to the type of project). If you have any suggestions based on yout experience with code analysis I will greatly appreciate if you can share some ideas about filtering of elements. Thanks, Martin

Read the article
Collaborative text editing in a web application

- by UmYeah

I have a group of writers around the world all trying to work on one story. I want them to be able to work with each other like they would on google documents, but from within our application. Specifically, being able to see who else is working on the document and what they are changing in real time. Something like an embedded etherpad. Are there any good solutions out in the wild? Im sure building something like this would be a significant project.

Read the article
looking for django app for collaborative Wiki

- by PlanetUnknown

I've been trying to find a django wiki app, which has the following - 1.) WYSIWYG 2.) Attach files 3.) Revisions I see moinmoin, but before going all in, wanted to see what you all have used.

Read the article
Filtering with SimpleCursorTreeAdapter

- by Aleksander O

I'm trying to implement filtering using ExpandableListView and SimpleCursorTreeAdapter (its subclass). How can I specify which data (maybe cursor field or TextView text) to use for filtering? Thanks

Read the article
Collaborative Latex

- by fabrizioM

I would like to insert notes on the fixes to be done in specific places of my latex document and maybe have a list of "todos/fixme" How do you handle this? Seems one way is to use the fixme package, but I couldn't make it work. Is Anyone using it?

Read the article
Filtering junk mail in Outlook 2007

- by Faruz

I keep getting mail which I try to filter as junk. Problem is, it is sent from my e-mail address while writing that the name of the sender is "Viagra yada yada yada ([email protected])" I can't add a rule to outlook according to sender's name, right? Is there any other way I can filter those e-mails (I can't add a rule to junk-mail my own e-mail address). The subject of the e-mails keeps changing in the following formats: "Faruz 76% discount now!" "Faruz 80% discount now!" "Now! 78% discount Faruz" (They seem to have infinite ideas of saying the same thing... :) ) So I also seem to have problems filtering according to subject. ideas?

Read the article

< Previous Page | 2 3 4 5 6 7 8 9 10 11 12 13 | Next Page >