How to get the body from a Word DOCX document in Python
1 min readMay 14, 2020
Extracting the data from the body portion of a DOCX file is just such an exciting process that I’m sure you can’t wait to spend your afternoon setting it up. Just kidding, I’m about to give you a shortcut that will deliver you from the tedium!
The API client that we need can be installed using pip install, as seen here.
pip install cloudmersive-convert-api-client
The next step is to call edit_document_docx_body from an API instance. This process is illustrated in the following example:
from __future__ import print_functionimport timeimport cloudmersive_convert_api_clientfrom cloudmersive_convert_api_client.rest import ApiExceptionfrom pprint import pprint# Configure API key authorization: Apikeyconfiguration = cloudmersive_convert_api_client.Configuration()configuration.api_key['Apikey'] = 'YOUR_API_KEY'# Uncomment below to setup prefix (e.g. Bearer) for API key, if needed# configuration.api_key_prefix['Apikey'] = 'Bearer'# create an instance of the API classapi_instance = cloudmersive_convert_api_client.EditDocumentApi(cloudmersive_convert_api_client.ApiClient(configuration))req_config = cloudmersive_convert_api_client.GetDocxBodyRequest() # GetDocxBodyRequest | Document input requesttry:# Get body from a Word DOCX documentapi_response = api_instance.edit_document_docx_body(req_config)pprint(api_response)except ApiException as e:print("Exception when calling EditDocumentApi->edit_document_docx_body: %s\n" % e)
Well that was easy. Go get yourself a coffee to celebrate all the boring work that you don’t have to do now.