data¶
-
class
zfit.data.
Data
(dataset: Union[tensorflow.python.data.ops.dataset_ops.DatasetV1, LightDataset], obs: Union[str, Iterable[str], zfit.Space] = None, name: str = None, weights=None, iterator_feed_dict: Dict[KT, VT] = None, dtype: tensorflow.python.framework.dtypes.DType = None)[source]¶ Bases:
zfit.util.execution.SessionHolderMixin
,zfit.util.cache.Cachable
,zfit.core.interfaces.ZfitData
,zfit.core.dimension.BaseDimensional
,zfit.core.baseobject.BaseObject
Create a data holder from a dataset used to feed into models.
Parameters: - () (dtype) – A dataset storing the actual values
- () – Observables where the data is defined in
- () – Name of the Data
- () –
- () –
-
add_cache_dependents
(cache_dependents: Union[zfit.core.interfaces.ZfitCachable, Iterable[zfit.core.interfaces.ZfitCachable]], allow_non_cachable: bool = True)¶ Add dependents that render the cache invalid if they change.
Parameters: - cache_dependents (ZfitCachable) –
- allow_non_cachable (bool) – If True, allow cache_dependents to be non-cachables. If False, any cache_dependents that is not a ZfitCachable will raise an error.
Raises: TypeError
– if one of the cache_dependents is not a ZfitCachable _and_ allow_non_cachable if False.
-
axes
¶ Return the axes.
-
convert_sort_space
(obs: Union[str, Iterable[str], zfit.Space] = None, axes: Union[int, Iterable[int]] = None, limits: Union[Tuple[Tuple[Tuple[float, ...]]], Tuple[float, float], bool] = None) → Optional[zfit.core.limits.Space][source]¶ Convert the inputs (using eventually obs, axes) to
Space
and sort them according to own obs.Parameters: - () (limits) –
- () –
- () –
Returns:
-
copy
(deep: bool = False, name: str = None, **overwrite_params) → zfit.core.interfaces.ZfitObject¶
-
data_range
¶
-
dtype
¶
-
classmethod
from_numpy
(obs: Union[str, Iterable[str], zfit.Space], array: numpy.ndarray, weights: Union[tensorflow.python.framework.ops.Tensor, None, numpy.ndarray] = None, name: str = None, dtype: tensorflow.python.framework.dtypes.DType = None)[source]¶ Create Data from a np.array.
Parameters: - () (obs) –
- array (numpy.ndarray) –
- name (str) –
Returns: Return type: zfit.Data
-
classmethod
from_pandas
(df: pandas.core.frame.DataFrame, obs: Union[str, Iterable[str], zfit.Space] = None, weights: Union[tensorflow.python.framework.ops.Tensor, None, numpy.ndarray] = None, name: str = None, dtype: tensorflow.python.framework.dtypes.DType = None)[source]¶ Create a Data from a pandas DataFrame. If obs is None, columns are used as obs.
Parameters:
-
classmethod
from_root
(path: str, treepath: str, branches: List[str] = None, branches_alias: Dict[KT, VT] = None, weights: Union[tensorflow.python.framework.ops.Tensor, None, numpy.ndarray, str] = None, name: str = None, dtype: tensorflow.python.framework.dtypes.DType = None, root_dir_options=None) → zfit.core.data.Data[source]¶ Create a Data from a ROOT file. Arguments are passed to uproot.
Parameters: - path (str) –
- treepath (str) –
- branches (List[str]]) –
- branches_alias (dict) – A mapping from the branches (as keys) to the actual observables (as values). This allows to have different observable names, independent of the branch name in the file.
- weights (tf.Tensor, None, np.ndarray, str]) – Weights of the data. Has to be 1-D and match the shape of the data (nevents). Can be a column of the ROOT file by using a string corresponding to a column.
- name (str) –
- () (root_dir_options) –
Returns: Return type: zfit.Data
-
classmethod
from_root_iter
(path, treepath, branches=None, entrysteps=None, name=None, **kwargs)[source]¶
-
classmethod
from_tensor
(obs: Union[str, Iterable[str], zfit.Space], tensor: tensorflow.python.framework.ops.Tensor, weights: Union[tensorflow.python.framework.ops.Tensor, None, numpy.ndarray] = None, name: str = None, dtype: tensorflow.python.framework.dtypes.DType = None) → zfit.core.data.Data[source]¶ Create a Data from a tf.Tensor. Value simply returns the tensor (in the right order).
Parameters: Returns: Return type: zfit.core.Data
-
iterator
¶
-
n_obs
¶ Return the number of observables.
-
name
¶ The name of the object.
-
nevents
¶
-
obs
¶ Return the observables.
-
register_cacher
(cacher: Union[zfit.core.interfaces.ZfitCachable, Iterable[zfit.core.interfaces.ZfitCachable]])¶ Register a cacher that caches values produces by this instance; a dependent.
Parameters: () (cacher) –
-
reset_cache
(reseter: zfit.util.cache.ZfitCachable)¶
-
reset_cache_self
()¶ Clear the cache of self and all dependent cachers.
-
sess
¶
-
set_sess
(sess: tensorflow.python.client.session.Session)¶ Set the session (temporarily) for this instance. If None, the auto-created default is taken.
Parameters: sess (tf.Session) –
-
set_weights
(weights: Union[tensorflow.python.framework.ops.Tensor, None, numpy.ndarray])[source]¶ Set (temporarily) the weights of the dataset.
Parameters: weights (tf.Tensor, np.ndarray, None) –
-
to_pandas
(obs: Union[str, Iterable[str], zfit.Space] = None)[source]¶ Create a pd.DataFrame from obs as columns and return it.
Parameters: () (obs) – The observables to use as columns. If None, all observables are used. Returns:
-
unstack_x
(obs: Union[str, Iterable[str], zfit.Space] = None, always_list: bool = False)[source]¶ Return the unstacked data: a list of tensors or a single Tensor.
Parameters: - () (obs) – which observables to return
- always_list (bool) – If True, always return a list (also if length 1)
Returns: List(tf.Tensor)
-
weights
¶