Ask HN: What does your ML pipeline look like?

As a DevOps Engineer working for a ML-based company and have had worked for others in the past, these are my quick suggestions for production readiness.

Can you elaborate on why downloading from S3 at startup is a bad idea? And why not synchronous everywhere as opposed to always queues?

Containers are meant to be stateless infrastructure. By downloading something at startup, you’re breaking that contract implicitly. Secondly, depending on where you’re deploying, downloads from S3 (and then loading to memory) may take a non-negligible amount of time that can impact the availability of your pods (again, depending on their configuration).

The one I’m working with _now_ is very low tech: daily Python processing data from GCP, and writing back to GCP; a handful of scripts that check everything is reasonnable. That’s because we serve internal results, mostly read by humans.

I see this as a very timely question. As ML has proliferated, so has the number of ways to construct machine learning pipelines. This means that from one project to another the tools/libraries change, the codebases start looking very different, and each project gets its own unique deployment and evaluation process.

Would be interested if anyone in here has a pipeline operating on regulated data (HIPAA, financial, etc). Having a hard time drawing boundaries around what the data science team has access to for development and experimentation vs. where production pipelines would operate. (e.g. where in the process do people/processes get access to live data)

This is great, thanks for the link.
Could you expand on how this workflow be different/better than sticking to just something like TFX and tensorflow serving? Is it easier to use or more scalable?

It is pretty much the same as TFX – but with Spark for both DataPrep and Distributed HyperparamOpt/Training, and a Feature Store. Model serving is slightly more sophisticated than just TensorFlow Serving on Kubernetes. We support serving requests through the Hopsworks REST API to TFServering/Kubernetes. This gives us both access control (clients have a TLS cert to authenticate and authorize themselves) and we log all predictions to a Kafka topic. We are adding support to enrich feature vectors using the Feature Store in the serving API, not quite there yet.

Here’s a framework we’ve been developing for this purpose, delivered as a python cookiecutter:

Depends on what you’re trying to do.

Thanks for the reply. Could you give some more insight into how and what tools you choose for the different sort of tasks (say NLP vs CV vs RL)? Also, how and why are different tools/pipelines better for production and product building?

Ask HN: What does your ML pipeline look like?

Leave a Reply Cancel reply

Next Post

Simple cooking methods flush arsenic out of rice

Breaking News

Quality Information on Smell Proof Bags

THE BEST UPCOMING MOVIES 2021 (New Trailers)

The 50 best teen shows of all time

Millions of Americans could be grounded from flying because of REAL ID deadline

NASAs Perseverance rover successfully makes oxygen on Mars

Top 10 Deadly Climate Change Predictions

The Climate Is Changing

Categories

Leave a Reply Cancel reply

You May Like

Breaking News

Categories