Python Library Caching
Nevar pievienot vairāk kā 25 tēmas Tēmai ir jāsākas ar burtu vai ciparu, tā var saturēt domu zīmes ('-') un var būt līdz 35 simboliem gara.

__init__.py 12KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290291292
  1. #!/usr/bin/env python
  2. # -*- coding: utf-8 -*-
  3. #
  4. """
  5. caching (Caching Module)
  6. ========================
  7. **Author:**
  8. * Dirk Alders <sudo-dirk@mount-mockery.de>
  9. **Description:**
  10. This Module supports functions and classes for caching e.g. properties of other instances.
  11. **Submodules:**
  12. * :class:`caching.property_cache_json`
  13. * :class:`caching.property_cache_pickle`
  14. **Unittest:**
  15. See also the :download:`unittest <caching/_testresults_/unittest.pdf>` documentation.
  16. """
  17. __DEPENDENCIES__ = []
  18. import json
  19. import logging
  20. import os
  21. import pickle
  22. import time
  23. try:
  24. from config import APP_NAME as ROOT_LOGGER_NAME
  25. except ImportError:
  26. ROOT_LOGGER_NAME = 'root'
  27. logger = logging.getLogger(ROOT_LOGGER_NAME).getChild(__name__)
  28. __DESCRIPTION__ = """The Module {\\tt %s} is designed to store information in {\\tt json} or {\\tt pickle} files to support them much faster then generating them from the original source file.
  29. For more Information read the documentation.""" % __name__.replace('_', '\_')
  30. """The Module Description"""
  31. __INTERPRETER__ = (3, )
  32. """The Tested Interpreter-Versions"""
  33. class property_cache_pickle(object):
  34. """
  35. This class caches the data from a given `source_instance`. It takes the data from the cache instead of generating the data from the `source_instance`,
  36. if the conditions for the cache usage are given.
  37. .. admonition:: Required properties for the `source_instance`
  38. * **uid():** returns the unique id of the source's source or None, if you don't want to use the unique id.
  39. * **keys():** returns a list of all available keys.
  40. * **data_version():** returns a version number of the current data (it should be increased, if the get method of the source instance returns improved values or the data structure had been changed).
  41. * **get(key, default):** returns the property for a key. If key does not exists, default will be returned.
  42. .. hint:: You are able to use all parameters and methods of the `source_instance` identically with the property_cache instance.
  43. :param source_instance: The source instance holding the data
  44. :type source_instance: instance
  45. :param cache_filename: File name, where the properties are stored as cache
  46. :type cache_filename: str
  47. :param load_all_on_init: True will load all data from the source instance, when the cache will be initialised the first time.
  48. :type load_all_on_init: bool
  49. :param callback_on_data_storage: The callback will be executed every time when the cache file is stored. It will be executed with the instance of this class as first argument.
  50. :type callback_on_data_storage: method
  51. :param max_age: The maximum age of the cached data in seconds or None for no maximum age.
  52. :type max_age: int or None
  53. :param store_on_get: False will prevent cache storage with execution of the `.get(key, default)` method. You need to store the cache somewhere else.
  54. :type store_on_get: bool
  55. .. admonition:: The cache will be used, if all following conditions are given
  56. * The key is in the list returned by `.keys()` method of the `source_instance`
  57. * The key is not in the list of keys added by the `.add_source_get_keys()` method.
  58. * The cache age is less then the given max_age parameter or the given max_age is None.
  59. * The uid of the source instance (e.g. a checksum or unique id of the source) is identically to to uid stored in the cache.
  60. * The data version of the `source_instance` is <= the data version stored in the cache.
  61. * The value is available in the previous stored information
  62. **Example:**
  63. .. literalinclude:: caching/_examples_/property_cache_pickle.py
  64. Will result on the first execution to the following output (with a long execution time):
  65. .. literalinclude:: caching/_examples_/property_cache_pickle_1.log
  66. With every following execution the time cosumption my by much smaller:
  67. .. literalinclude:: caching/_examples_/property_cache_pickle_2.log
  68. """
  69. DATA_VERSION_TAG = '_property_cache_data_version_'
  70. STORAGE_VERSION_TAG = '_storage_version_'
  71. UID_TAG = '_property_cache_uid_'
  72. DATA_TAG = '_data_'
  73. AGE_TAG = '_age_'
  74. #
  75. STORAGE_VERSION = 1
  76. def __init__(self, source_instance, cache_filename, load_all_on_init=False, callback_on_data_storage=None, max_age=None, store_on_get=True):
  77. self._source_instance = source_instance
  78. self._cache_filename = cache_filename
  79. self._load_all_on_init = load_all_on_init
  80. self._callback_on_data_storage = callback_on_data_storage
  81. self._max_age = max_age
  82. self._store_on_get = store_on_get
  83. #
  84. self._source_get_keys = []
  85. self._cached_props = None
  86. def add_source_get_keys(self, keys):
  87. """
  88. This will add one or more keys to a list of keys which will always be provided by the `source_instance` instead of the cache.
  89. :param keys: The key or keys to be added
  90. :type keys: list, tuple, str
  91. """
  92. if type(keys) in [list, tuple]:
  93. self._source_get_keys.extend(keys)
  94. else:
  95. self._source_get_keys.append(keys)
  96. def full_update(self):
  97. """
  98. With the execution of this method, the complete source data which needs to be cached, will be read from the source instance
  99. and the resulting cache will be stored to the given file.
  100. .. hint:: Use this method, if you initiallised the class with `store_on_get=False`
  101. """
  102. self._load_source()
  103. self._save_cache()
  104. def get(self, key, default=None):
  105. """
  106. Method to get the cached property. If the key does not exists in the cache or `source_instance`, `default` will be returned.
  107. :param key: key for value to get.
  108. :param default: value to be returned, if key does not exists.
  109. :returns: value for a given key or default value.
  110. """
  111. if key in self.keys() and key not in self._source_get_keys:
  112. if self._cached_props is None:
  113. self._init_cache()
  114. if self._max_age is None:
  115. cache_old = False
  116. else:
  117. cache_old = time.time() - self._cached_props[self.AGE_TAG].get(self._key_filter(key), 0) > self._max_age
  118. if cache_old:
  119. logger.debug("The cached value is old, cached value will be ignored")
  120. if self._key_filter(key) not in self._cached_props[self.DATA_TAG] or cache_old:
  121. logger.debug("Loading property for key='%s' from source instance", key)
  122. val = self._source_instance.get(key, None)
  123. if self._store_on_get:
  124. tm = int(time.time())
  125. logger.debug("Adding key=%s, value=%s with timestamp=%d to chache", key, val, tm)
  126. self._cached_props[self.DATA_TAG][self._key_filter(key)] = val
  127. self._cached_props[self.AGE_TAG][self._key_filter(key)] = tm
  128. self._save_cache()
  129. else:
  130. return val
  131. else:
  132. logger.debug("Providing property for '%s' from cache", key)
  133. return self._cached_props[self.DATA_TAG].get(self._key_filter(key), default)
  134. else:
  135. if key not in self.keys():
  136. logger.debug("Key '%s' is not in cached_keys. Uncached data will be returned.", key)
  137. elif key in self._source_get_keys:
  138. logger.debug("Key '%s' is excluded by .add_source_get_keys(). Uncached data will be returned.", key)
  139. return self._source_instance.get(key, default)
  140. def _data_version(self):
  141. if self._cached_props is None:
  142. return None
  143. else:
  144. return self._cached_props.get(self.DATA_VERSION_TAG, None)
  145. def _storage_version(self):
  146. if self._cached_props is None:
  147. return None
  148. else:
  149. return self._cached_props.get(self.STORAGE_VERSION_TAG, None)
  150. def _init_cache(self):
  151. load_cache = self._load_cache()
  152. uid = self._source_instance.uid() != self._uid()
  153. try:
  154. data_version = self._source_instance.data_version() > self._data_version()
  155. except TypeError:
  156. data_version = True
  157. try:
  158. storage_version = self._storage_version() != self.STORAGE_VERSION
  159. except TypeError:
  160. storage_version = True
  161. #
  162. if not load_cache or uid or data_version or storage_version:
  163. if load_cache:
  164. if self._uid() is not None and uid:
  165. logger.debug("Source uid changed, ignoring previous cache data")
  166. if self._data_version() is not None and data_version:
  167. logger.debug("Data version increased, ignoring previous cache data")
  168. if storage_version:
  169. logger.debug("Storage version changed, ignoring previous cache data")
  170. self._cached_props = {self.AGE_TAG: {}, self.DATA_TAG: {}}
  171. if self._load_all_on_init:
  172. self._load_source()
  173. self._cached_props[self.UID_TAG] = self._source_instance.uid()
  174. self._cached_props[self.DATA_VERSION_TAG] = self._source_instance.data_version()
  175. self._cached_props[self.STORAGE_VERSION_TAG] = self.STORAGE_VERSION
  176. self._save_cache()
  177. def _load_cache(self):
  178. if os.path.exists(self._cache_filename):
  179. with open(self._cache_filename, 'rb') as fh:
  180. self._cached_props = pickle.load(fh)
  181. logger.debug('Loading properties from cache (%s)', self._cache_filename)
  182. return True
  183. else:
  184. logger.debug('Cache file does not exists (yet).')
  185. return False
  186. def _key_filter(self, key):
  187. return key
  188. def _load_source(self):
  189. if self._cached_props is None:
  190. self._init_cache()
  191. logger.debug('Loading all data from source - %s', repr(self.keys()))
  192. for key in self.keys():
  193. if key not in self._source_get_keys:
  194. self._cached_props[self.DATA_TAG][self._key_filter(key)] = self._source_instance.get(key)
  195. self._cached_props[self.AGE_TAG][self._key_filter(key)] = int(time.time())
  196. def _save_cache(self):
  197. with open(self._cache_filename, 'wb') as fh:
  198. pickle.dump(self._cached_props, fh)
  199. logger.debug('cache-file stored (%s)', self._cache_filename)
  200. if self._callback_on_data_storage is not None:
  201. self._callback_on_data_storage(self)
  202. def _uid(self):
  203. if self._cached_props is None:
  204. return None
  205. else:
  206. return self._cached_props.get(self.UID_TAG, None)
  207. def __getattribute__(self, name):
  208. try:
  209. return super().__getattribute__(name)
  210. except AttributeError:
  211. return getattr(self._source_instance, name)
  212. class property_cache_json(property_cache_pickle):
  213. """
  214. See also parent :py:class:`property_cache_pickle` for detailed information.
  215. .. important::
  216. * This class uses json. You should **only** use keys of type string!
  217. * Unicode types are transfered to strings
  218. See limitations of json.
  219. **Example:**
  220. .. literalinclude:: caching/_examples_/property_cache_json.py
  221. Will result on the first execution to the following output (with a long execution time):
  222. .. literalinclude:: caching/_examples_/property_cache_json_1.log
  223. With every following execution the time cosumption my by much smaller:
  224. .. literalinclude:: caching/_examples_/property_cache_json_2.log
  225. """
  226. def _load_cache(self):
  227. if os.path.exists(self._cache_filename):
  228. with open(self._cache_filename, 'r') as fh:
  229. self._cached_props = json.load(fh)
  230. logger.debug('Loading properties from cache (%s)', self._cache_filename)
  231. return True
  232. else:
  233. logger.debug('Cache file does not exists (yet).')
  234. return False
  235. def _save_cache(self):
  236. with open(self._cache_filename, 'w') as fh:
  237. json.dump(self._cached_props, fh, sort_keys=True, indent=4)
  238. logger.debug('cache-file stored (%s)', self._cache_filename)
  239. if self._callback_on_data_storage is not None:
  240. self._callback_on_data_storage(self)