Efficient job progress update in web application

Posted by Endru6 on Stack Overflow See other posts from Stack Overflow or by Endru6
Published on 2010-05-18T07:35:52Z Indexed on 2010/05/18 7:40 UTC
Read the original article Hit count: 158

Hi,

Creating a web application (Django in my case, but I think the question is more general) that is administrating a cluster of workers doing queued jobs, there is a need to track each jobs progress.

When I've done it using database UPDATE (PostgreSQL in this case), it severely hits the database performance, because each UPDATE creates a new row in a table, and in my case only vacuuming DB removes obsolete rows. Having 30 jobs running and reporting progress every 1 minute DB may require vacuuming (and it means huge slow downs on a front end side for all the employees working with the system) every 10 days.

Because the progress information isn't critical, ie. it doesn't have to be persistent, how would you do the progress updates from jobs without using an overhead database implies? There are 30 worker servers, each doing 1 or 2 jobs simultaneously, 1 front end server which serves a web application to users, and 1 database server.

© Stack Overflow or respective owner

Related posts about web-applications

Related posts about database