Skip to content

A simple Salesforce.com bulk API client for Python 3

License

Notifications You must be signed in to change notification settings

donaldrauscher/sfdc-bulk

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

sfdc-bulk

A simple Salesforce.com bulk API client for Python 3

This library provides some simple functions for interacting with Salesforce.com's bulk API, a great tool for updating large amounts of data in Salesforce.

The big difference between working with the REST API and working with the bulk API is that the bulk API is asynchronous. You create a job, add one or more batches to that job, close the job, and wait for it to complete. Once complete, you download the results (in pieces). So a single 'operation' requires several HTTP calls to the bulk API and a rather annoying process of stitching together results. This library provides a few methods to simplify this workflow. Inputs and outputs are Pandas dataframes.

To avoid reinventing the wheel, I leveraged the SalesforceLogin class in the simple_salesforce package for authentication.

Package is available on PyPI and can be installed with pip install sfdc_bulk.

Main Methods

Here are the key methods for bulk API queries:

  • query: creates a query job and submits a SOQL query as a batch to that job
  • get_all_query_results: downloads results from a query job into a Pandas dataframe

Here are the key methods for bulk API jobs:

  • create_job: creates a job (operations types include 'insert', 'upsert , 'update', 'delete', and 'hardDelete')
  • bulk_csv_operation: breaks Pandas dataframe into chunks and adds each chunk as a batches to a job
  • get_bulk_csv_operation_results: downloads results from a bulk CSV job into a Pandas dataframe

Example

# set up our SF object
bulk = SalesforceBulkAPI(**sfdc_credentials)

# pull down some records into a pandas df
query_job = bulk.query('SELECT Id, Company FROM Lead LIMIT 10000')
some_records = bulk.get_all_query_results(query_job)

# make an update
update = some_records[some_records.Company == "Blockbuster Video"]
update['DoNotCall'] = 1

# push to SFDC
update_job = bulk.create_update_job(object='Lead', contentType='CSV')
bulk.bulk_csv_operation(update_job, update)
update_results = bulk.get_bulk_csv_operation_results(update_job)

About

A simple Salesforce.com bulk API client for Python 3

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages