Search Results

Search found 17 results on 1 pages for 'mahout'.

Page 1/1 | 1

Mahout - Error when try out wikipedia exmaples

- by Li'

Note this post is similar to Caused by: java.lang.ClassNotFoundException: classpath but different error message. When I try to run Wikipedia Bayes Example from https://cwiki.apache.org/confluence/display/MAHOUT/Wikipedia+Bayes+Example When I ran the following command : lis-macbook-pro:mahout-distribution-0.8 Li$ mahout wikipediaXMLSplitter…

Read the article
Is it worth purchasing Mahout in Action to get up to speed with Mahout, or are there other better sources?

- by gab

I'm currently a very casual user of Apache Mahout, and I'm considering purchasing the book Mahout in Action. Unfortunately, I'm having a really hard time getting an idea of how worth it this book is -- and seeing as it's a Manning Early Access Program book (and therefore only currently available as a beta-version e-book), I can't take a look…

Read the article
Mahout - Clustering - "naming" the cluster elements

- by Mark Bramnik

I'm doing some research and I'm playing with Apache Mahout 0.6 My purpose is to build a system which will name different categories of documents based on user input. The documents are not known in advance and I don't know also which categories do I have while collecting these documents. But I do know, that all the documents in the model…

Read the article
ClassNotFoundException error in implementing Bayesian algorithm in Apache Mahout on Hadoop

- by Shweta

Hi, I have a problem in executing the Bayesian algorithm in Mahout. I built it with Maven and the job file is in target directory. When run from terminal using hadoop, I'm getting the ClassNotFoundException error. What should be done? $HADOOP_HOME/bin/hadoop jar mahout-core-0.3-SNAPSHOT.job…

Read the article
Apache Mahout Advices?

- by MarySheen

Have you implemented Apache Mahout recommendation engine? any advices you can share? any other sites you know that use Mahout? thanks!

Read the article
identify documents from results of mahout clustering

- by Tejas

I am using mahout to cluster text documents indexed using solr. I have used the "text" field in the document to form vectors. Then I used the k-means driver in mahout for clustering and then the clusterdumper utility to dump the results. I am having difficulty in understanding the output…

Read the article
How to use Mahout in a Windows environment?

- by oopdemo

I am trying to use Mahout in an application running on Windows. I want to build clusters from a lucene index using k-means. As soon as I have to create sequence files (creating vectors from a lucene index), I get a Hadoop-Exception, since Hadoop makes command line calls to programs unknown…

Read the article
Mahout Naive Bayes Classifier for Items

- by Nimesh Parikh

Team, I am working on a project where i need to classify Items into certain category. I have a single file as input; which contains target variable and space separated features. My training data will look like Category Name [Tab] DataString Plumbing [Tab] Pipe Tap Plastic Pipe PVC Pipe…

Read the article
Mahout Recommendations on Binary data

- by Pranay Kumar

Hi, I'm a newbie to mahout.My aim is to produce recommendations on binary user purchased data.So i applied item-item similarity model in computing top N recommendations for movie lens data assuming 1-3 ratings as a 0 and 4-5 ratings as a 1.Then i tried evaluating my recommendations with…

Read the article
Colloborative filtering

- by Pranay Kumar

How can i use SVD algorithm in mahout for producing recommendations on explicit binary data-set (eg. a user purchased or not but no specific ratings ) in an e-commerce domain ? Also what algorithms aim at producing recommendations on such binary data-sets ? Thanks in advance. Pranay…

Read the article
Do you know of any python mapreduce ready clustering libraries?

- by user319045

Do you know of any python mapreduce ready clustering libraries? I have found some good libraries in Java (http://lucene.apache.org/mahout/), I'd prefer to use python though.

Read the article
How to remove large number of files/folders in linux

- by user1745713

We are using hadoop to split a table into smaller files to feed to mahout, but in the process, we created a huge amount of _temporary logs. we have an nfs mount for the hadoop volume so we can use all the linux commands to delete folders files, but we just can't get them to be deleted,…

Read the article
What's the best C# recommendation engine or framework?

- by cDima

Is there anyway to use the examples for the "My Media" Microsoft research project? My Media is a "dynamic personalization and recommendation software framework toolkit" ( http://www.mymediaproject.org ), but out of the box it doesn't provide a sample database (only a LINQ-to-SQL .dbml…

Read the article
KMeans clustering for more than 5 million vectors

- by Wajih

I have hit a real problem. I need to do some Kmeans clustering for 5 million vectors, each containing about 32 cols. I tried out Mahout which requires linux and I am on windows, I am restrained from using a Linux OS and any sort of simulator. Can anyone suggest a KMeans clustering…

Read the article
Windows Azure Recipe: Big Data

- by Clint Edmonson

As the name implies, what we’re talking about here is the explosion of electronic data that comes from huge volumes of transactions, devices, and sensors being captured by businesses today. This data often comes in unstructured formats and/or too fast for us to effectively process in…

Read the article
The Buzz at the JavaOne Bookstore

- by Janice J. Heiss

I found my way to the JavaOne bookstore, a hub of activity. Who says brick and mortar bookstores are dead? I asked what was hot and got two answers: Hadoop in Practice by Alex Holmes was doing well. And Scala for the Impatient by noted Java Champion Cay Horstmann also seemed to be a…

Read the article
Big Data – Various Learning Resources – How to Start with Big Data? – Day 20 of 21

- by Pinal Dave

In yesterday’s blog post we learned how to become a Data Scientist for Big Data. In this article we will go over various learning resources related to Big Data. In this series we have covered many of the most essential details about Big Data. At the beginning of this series, I have…

Read the article

1