What is the "task" in twitter Storm parallelism

Posted by John Wang on Stack Overflow See other posts from Stack Overflow or by John Wang
Published on 2013-06-23T03:23:18Z Indexed on 2013/10/19 3:55 UTC
Read the original article Hit count: 254

I'm trying to learn twitter storm by following the great article "Understanding the parallelism of a Storm topology"

However I'm a bit confused by the concept of "task". Is a task an running instance of the component(spout or bolt) ? A executor having multiple tasks actually is saying the same component is executed for multiple times by the executor, am I correct ?

Moreover in a general parallelism sense, Storm will spawn a dedicated thread(executor) for a spout or bolt, but what is contributed to the parallelism by an executor(thread) having multiple tasks ? I think having multiple tasks in a thread, since a thread executes sequentially, only make the thread a kind of "cached" resource, which avoids spawning new thread for next task run. Am I correct?

I may clear those confusion by myself after taking more time to investigate, but you know, we both love stackoverflow ;-)

Thanks in advance.

© Stack Overflow or respective owner

Related posts about parallel-processing

Related posts about topology