Scalable Efficient Big Data Pipeline Architecture Machine Learning