We present a cloud based platform for big data and stream processing with workflows. The ClowdFlows platform enables processing of multiple concurrent data streams. Several machine learning algorithms were implemented in the map-reduce paradigm. Using all data in distributed mode is better than using a subset in non-distributed. The ClowdFlows platform handles big data sets with nearly perfect linear speedup.