Skip to content

Latest commit

 

History

History
96 lines (65 loc) · 3.82 KB

03-preprocessing.md

File metadata and controls

96 lines (65 loc) · 3.82 KB

10.3 Creating a pre-processing service

In the previous section we created jupyter notebook to communicate with the model deployed with tensorflow. This notebook fetches an image, pre-processes it, turns it into protobuf, sends it to tensorflow-serving, does post-processing, and finally gives a human-readable answer.

In this section we convert the notebook into python script to build a flask application. To convert the notebook into script we can run the command jupyter nbconvert --to script notebook.ipynb. We also rename the script to gateway.py.

Then we create functions to prepare a request, send it, and prepare the response. For the flask app, we can reuse the code from session 5:

# Create flask app
app = Flask('gateway')

@app.route('/predict', methods=['POST'])
def predict_endpoint():
    data = request.get_json()
    url = data['url']
    result = predict(url)
    return jsonify(result)

Our application has two components: docker container with tensorflow serving and flask application with the gateway.

We also want to put everything in the pipenv for deployment. For that we need to install few libraries with pipenv: pipenv install grpcio==1.42.0 flask gunicorn keras-image-helper.

As we discussed tensorflow is a large library and we don't want to use it in our application. Instead we can use the following script to convert numpy array into protobuf format and import the np_to_protobuf function into our gateway.py script. In order the make the script work we need to install the following libraries as well pipenv install tensorflow-protobuf==2.7.0 protobuf==3.19:

from tensorflow.core.framework import tensor_pb2, tensor_shape_pb2, types_pb2


def dtypes_as_dtype(dtype):
    if dtype == "float32":
        return types_pb2.DT_FLOAT
    raise Exception("dtype %s is not supported" % dtype)


def make_tensor_proto(data):
    shape = data.shape
    dims = [tensor_shape_pb2.TensorShapeProto.Dim(size=i) for i in shape]
    proto_shape = tensor_shape_pb2.TensorShapeProto(dim=dims)

    proto_dtype = dtypes_as_dtype(data.dtype)

    tensor_proto = tensor_pb2.TensorProto(dtype=proto_dtype, tensor_shape=proto_shape)
    tensor_proto.tensor_content = data.tostring()

    return tensor_proto


def np_to_protobuf(data):
    if data.dtype != "float32":
        data = data.astype("float32")
    return make_tensor_proto(data)

Links

Notes

Add notes from the video (PRs are welcome)

  • turn jupyter notebook into flask app
  • the notebook communicates with the model deployed with tensorflow
  • the notebook fetches an image, pre-processes it, turns it into protobuf, sends it to tensorflow-serving, does post-processing and finally gives a human-readable answer
  • convert notebook into python script and call the script gateway
  • prepare request, send request, prepare response
  • you can reuse the flask app code from session 5
  • two components: docker container with tensorflow serving and flask application with the gateway
  • be aware of the library sizes: tensorflow 1.7 GB, tensorflow CPU ~400 MB, tensorflow serving
  • turn numpy array into protobuf format
  • tensorflow protobuf
⚠️ The notes are written by the community.
If you see an error here, please create a PR with a fix.

Navigation