{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "# Python API Example - Access Slots Import and Storage in Dataframe\n", "\n", "This guide is designed to provide an example of how to access the Spark API:\n", "- The path to your client credentials is the only input needed to run this script (just before Section 2)\n", "- This script has been designed to display the raw outputs of requests from the API\n", "- This script can be copied and pasted by customers for quick use of the API\n", "\n", "__N.B. This guide is just for Access terminal slots data. If you're looking for other API data products (such as contract prices, Freight routes or Netbacks), please refer to their according code example files.__ " ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "### Have any questions?\n", "\n", "If you have any questions regarding our API, or need help accessing specific datasets, please contact us at:\n", "\n", "__data@sparkcommodities.com__\n", "\n", "or refer to our API website for more information about this endpoint: https://www.sparkcommodities.com/api/lng-access/slot-availability.html" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## 1. Importing Data\n", "\n", "Here we define the functions that allow us to retrieve the valid credentials to access the Spark API.\n", "\n", "This section can remain unchanged for most Spark API users." ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "import json\n", "import os\n", "import sys\n", "import pandas as pd\n", "import numpy as np\n", "from base64 import b64encode\n", "from pprint import pprint\n", "from urllib.parse import urljoin\n", "import datetime\n", "import time\n", "from io import StringIO\n", "import requests\n", "\n", "\n", "try:\n", " from urllib import request, parse\n", " from urllib.error import HTTPError\n", "except ImportError:\n", " raise RuntimeError(\"Python 3 required\")\n", "\n", "\n", "API_BASE_URL = \"https://api.sparkcommodities.com\"\n", "\n", "\n", "def retrieve_credentials(file_path=None):\n", " \"\"\"\n", " Find credentials either by reading the client_credentials file or reading\n", " environment variables\n", " \"\"\"\n", " if file_path is None:\n", " client_id = os.getenv(\"SPARK_CLIENT_ID\")\n", " client_secret = os.getenv(\"SPARK_CLIENT_SECRET\")\n", " if not client_id or not client_secret:\n", " raise RuntimeError(\n", " \"SPARK_CLIENT_ID and SPARK_CLIENT_SECRET environment vars required\"\n", " )\n", " else:\n", " # Parse the file\n", " if not os.path.isfile(file_path):\n", " raise RuntimeError(\"The file {} doesn't exist\".format(file_path))\n", "\n", " with open(file_path) as fp:\n", " lines = [l.replace(\"\\n\", \"\") for l in fp.readlines()]\n", "\n", " if lines[0] in (\"clientId,clientSecret\", \"client_id,client_secret\"):\n", " client_id, client_secret = lines[1].split(\",\")\n", " else:\n", " print(\"First line read: '{}'\".format(lines[0]))\n", " raise RuntimeError(\n", " \"The specified file {} doesn't look like to be a Spark API client \"\n", " \"credentials file\".format(file_path)\n", " )\n", "\n", " print(\">>>> Found credentials!\")\n", " print(\n", " \">>>> Client_id={}****, client_secret={}****\".format(client_id[:3], client_secret[:5])\n", " )\n", "\n", " return client_id, client_secret\n", "\n", "\n", "def do_api_post_query(uri, body, headers):\n", " url = urljoin(API_BASE_URL, uri)\n", "\n", " data = json.dumps(body).encode(\"utf-8\")\n", "\n", " # HTTP POST request\n", " req = request.Request(url, data=data, headers=headers)\n", " try:\n", " response = request.urlopen(req)\n", " except HTTPError as e:\n", " print(\"HTTP Error: \", e.code)\n", " print(e.read())\n", " sys.exit(1)\n", "\n", " resp_content = response.read()\n", "\n", " # The server must return HTTP 201. Raise an error if this is not the case\n", " assert response.status == 201, resp_content\n", "\n", " # The server returned a JSON response\n", " content = json.loads(resp_content)\n", "\n", " return content\n", "\n", "\n", "def do_api_get_query(uri, access_token, format='json'):\n", " \"\"\"\n", " After receiving an Access Token, we can request information from the API.\n", " \"\"\"\n", " url = urljoin(API_BASE_URL, uri)\n", "\n", " if format == 'json':\n", " headers = {\n", " \"Authorization\": \"Bearer {}\".format(access_token),\n", " \"Accept\": \"application/json\",\n", " }\n", " elif format == 'csv':\n", " headers = {\n", " \"Authorization\": \"Bearer {}\".format(access_token),\n", " \"Accept\": \"text/csv\"\n", " }\n", " else:\n", " raise ValueError('The format parameter only takes `csv` or `json` as inputs')\n", "\n", " # HTTP GET request\n", " req = request.Request(url, headers=headers)\n", " try:\n", " response = request.urlopen(req)\n", " except HTTPError as e:\n", " print(\"HTTP Error: \", e.code)\n", " print(e.read())\n", " sys.exit(1)\n", "\n", " resp_content = response.read()\n", " #status = response.status\n", "\n", " # The server must return HTTP 200. Raise an error if this is not the case\n", " assert response.status == 200, resp_content\n", "\n", " # Storing response based on requested format\n", " if format == 'json':\n", " content = json.loads(resp_content)\n", " elif format == 'csv':\n", " content = resp_content\n", "\n", " return content\n", "\n", "\n", "def get_access_token(client_id, client_secret):\n", " \"\"\"\n", " Get a new access_token. Access tokens are the thing that applications use to make\n", " API requests. Access tokens must be kept confidential in storage.\n", "\n", " # Procedure:\n", "\n", " Do a POST query with `grantType` in the body. A basic authorization\n", " HTTP header is required. The \"Basic\" HTTP authentication scheme is defined in\n", " RFC 7617, which transmits credentials as `clientId:clientSecret` pairs, encoded\n", " using base64.\n", " \"\"\"\n", "\n", " # Note: for the sake of this example, we choose to use the Python urllib from the\n", " # standard lib. One should consider using https://requests.readthedocs.io/\n", "\n", " payload = \"{}:{}\".format(client_id, client_secret).encode()\n", " headers = {\n", " \"Authorization\": \"Basic {}\".format(b64encode(payload).decode()),\n", " \"Accept\": \"application/json\",\n", " \"Content-Type\": \"application/json\",\n", " }\n", " body = {\n", " \"grantType\": \"clientCredentials\",\n", " }\n", "\n", " content = do_api_post_query(uri=\"/oauth/token/\", body=body, headers=headers)\n", "\n", " print(\n", " \">>>> Successfully fetched an access token {}****, valid {} seconds.\".format(\n", " content[\"accessToken\"][:5], content[\"expiresIn\"]\n", " )\n", " )\n", "\n", " return content[\"accessToken\"]\n" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## N.B. Credentials\n", "\n", "Here we call the above functions, and input the file path to our credentials.\n", "\n", "N.B. You must have downloaded your client credentials CSV file before proceeding. Please refer to the API documentation if you have not dowloaded them already. Instructions for downloading your credentials can be found here:\n", "\n", "https://www.sparkcommodities.com/api/request/authentication.html\n" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "# Insert file path to your client credentials here\n", "client_id, client_secret = retrieve_credentials(file_path=\"/tmp/client_credentials.csv\")\n", "\n", "# Authenticate:\n", "access_token = get_access_token(client_id, client_secret)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## 2. Latest Slot Release\n", "\n", "Here we call the latest slot release and print it in a readable format. This is done using the URL:\n", "\n", "__/beta/terminal-slots/releases/latest/__\n", "\n", "\n", "We then save the entire dataset as a local variable called `latest`." ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "# Function to get the latest slot release\n", "def get_latest_slots():\n", " uri = urljoin(API_BASE_URL,'/beta/terminal-slots/releases/latest/')\n", " headers = {\n", " \"Authorization\": \"Bearer {}\".format(access_token),\n", " \"accept\": \"text/csv\"\n", " }\n", " response = requests.get(uri, headers=headers)\n", " if response.status_code == 200:\n", " df = response.content.decode('utf-8')\n", " df = pd.read_csv(StringIO(df))\n", " else:\n", " print('Bad Request')\n", " return df" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "# Call latest slots function\n", "latest = get_latest_slots()\n", "latest.head()" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## 3. Slot Release by Release Date\n", "\n", "Here we call the slot data by choosing a specific date and print it in a readable format. This is done using the URL:\n", "\n", "__/beta/terminal-slots/releases/{date}/__ where __date__ is the release date, in the \"YYYY-MM-DD\" string format.\n", "\n", "\n", "We then save the entire dataset as a local variable called `release_df`." ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "# Function to get the slot releases for a specific date\n", "def get_slot_releases(date):\n", " uri = urljoin(API_BASE_URL, f'/beta/terminal-slots/releases/{date}/')\n", " headers = {\n", " \"Authorization\": \"Bearer {}\".format(access_token),\n", " \"accept\": \"text/csv\"\n", " }\n", "\n", " response = requests.get(uri, headers=headers)\n", "\n", " if response.status_code == 200:\n", " df = response.content.decode('utf-8')\n", " df = pd.read_csv(StringIO(df))\n", " return df\n", "\n", " elif response.content == b'{\"errors\":[{\"code\":\"object_not_found\",\"detail\":\"Object not found\"}]}':\n", " print('Bad Date')\n", " return None\n", " \n", " else:\n", " print('Bad Request')\n", " return None" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "### N.B. Plan Limits\n", "\n", "__Premium__ Users can choose any release date, as they have full access to the dataset.\n", "\n", "__Trial__ Users must choose a release date within the last 2 weeks." ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "# Calling slot release function\n", "release_df = get_slot_releases(\"2024-10-22\")\n", "release_df.head()" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## 4. Terminal List\n", "\n", "Here we call the list of terminals and their uuid, and print it in a readable format. This is done using the URL:\n", "\n", "__'beta/terminal-slots/terminals/'__\n", "\n", "\n", "We then save the entire dataset as a local variable called `terminal_list`." ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "# Function to get the list of terminals and their uuids (as well as their start and latest release date)\n", "def get_terminal_list():\n", " uri = urljoin(API_BASE_URL,'beta/terminal-slots/terminals/')\n", " headers = {\n", " \"Authorization\": \"Bearer {}\".format(access_token),\n", " \"accept\": \"text/csv\"\n", " }\n", " response = requests.get(uri, headers=headers)\n", " if response.status_code == 200:\n", " df = response.content.decode('utf-8')\n", " df = pd.read_csv(StringIO(df))\n", " else:\n", " print('Bad Request')\n", " return df" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "# Call terminal list function\n", "terminal_list = get_terminal_list()\n", "terminal_list.head()" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Fetching Slots Data specific to one terminal\n", "\n", "Now that we can see all the terminal data available to us, we can start to define what terminal we want to call slots data for (by referring to `terminal_list` above).\n", "\n", "The first step is to choose which terminal uuid (`my_uuid`), then the request will return all the historical data available for that terminal." ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "# Function to collect and store historical slots for one specific terminal\n", "def get_individual_terminal(terminal_uuid):\n", " uri = urljoin(API_BASE_URL, f'/beta/terminal-slots/terminals/{terminal_uuid}/')\n", " headers = {\n", " \"Authorization\": \"Bearer {}\".format(access_token),\n", " \"accept\": \"text/csv\"\n", " }\n", " response = requests.get(uri, headers=headers)\n", " if response.status_code == 200:\n", " df = response.content.decode('utf-8')\n", " df = pd.read_csv(StringIO(df))\n", " return df\n", "\n", " elif response.content == b'{\"errors\":[{\"code\":\"object_not_found\",\"detail\":\"Object not found\"}]}':\n", " print('Bad Terminal Request')\n", " return None\n", " else:\n", " print('Bad Request')\n", " return None" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "# Call individual terminal function (Dragon)\n", "my_uuid = terminal_list[terminal_list['TerminalName']=='Dragon']['TerminalUUID'].tolist()[0]\n", "dragon_hist_df = get_individual_terminal(my_uuid)\n", "dragon_hist_df" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## 5. Historical Slots\n", "\n", "Here we collect all the historical slots for each available terminal. This is done using the URL:\n", "\n", "__/beta/terminal-slots/terminals/{terminal_uuid}/__\n", "\n", "First we define the function that imports the data from the Spark API.\n", "\n", "We then call that function, and define the list of terminal uuids.\n", "\n", "\n", "We save the output as a local variable called `all_terminals_historical`.\n", "\n", "### N.B. Plan Limits\n", "\n", "__Premium__ Users have full access to the dataset.\n", "\n", "__Trial__ Users only have access the latest 2 weeks' worth of datapoints." ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "# Function to collect and store each terminal's historical slots data\n", "def get_all_terminal_data(terminal_list):\n", " terminals_all = pd.DataFrame()\n", " for i in range(len(terminal_list)):\n", " print(terminal_list['TerminalName'].loc[i])\n", " terminal_df = get_individual_terminal(terminal_list['TerminalUUID'].loc[i])\n", " time.sleep(0.1)\n", " terminals_all = pd.concat([terminals_all,terminal_df])\n", " return terminals_all" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "# Calling all terminal data function\n", "all_terminal_historical = get_all_terminal_data(terminal_list)\n", "all_terminal_historical.head()" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "# Analytics Gallery\n", "Want to gain market insights using our data?\n", "\n", "Take a look at our [Analytics Gallery](https://www.sparkcommodities.com/api/code-examples/analytics-examples.html) on the Spark API website, which includes\n", "\n", "- __Regas Slots vs US Arb__ - See how the US Arb (M+1, M+2, M+3) correlates with monthly average slots for all European Terminals.\n", "__Note: Requires Cargo and Access subscriptions__\n", "\n", "\n", "Take a look at our Regas Slots vs US Arb chart [here](https://www.sparkcommodities.com/api/code-examples/analytics-examples.html). " ] } ], "metadata": { "kernelspec": { "display_name": "base", "language": "python", "name": "python3" }, "language_info": { "codemirror_mode": { "name": "ipython", "version": 3 }, "file_extension": ".py", "mimetype": "text/x-python", "name": "python", "nbconvert_exporter": "python", "pygments_lexer": "ipython3", "version": "3.11.5" } }, "nbformat": 4, "nbformat_minor": 2 }