Insights

Maven Fuel Oil Inventory Pressure Research Framework (IPRF)

Structural Relationships Between Observable Fuel Oil Inventory Pressure and Forward Singapore HSFO Pricing Behaviour.

Introduction

This paper presents the Maven Fuel Oil Inventory Pressure Research Framework, a systematic research programme designed to test whether observable physical fuel oil inventory pressure exhibits statistically meaningful relationships with future Singapore HSFO crack pricing behaviour.

The framework was developed to evaluate whether cargo-level physical intelligence, regional arrivals data, operational cargo maturity, and inventory accumulation behaviour can be reconstructed into a repeatable research architecture capable of explaining forward pricing structure.

The research does not attempt to present a deterministic trading model. Instead, it functions as a structural market analysis framework. Its purpose is to test whether observable supply pressure contains measurable information about future pricing behaviour, and to identify which definitions of physical inventory pressure are most informative.

The research focused on Singapore HSFO cracks because Singapore remains one of the most important regional pricing and clearing centres for fuel oil and bunker-related supply. The analysis therefore tested whether observable regional fuel oil arrivals, particularly HSFO-related cargoes, could help explain future movements in Singapore HSFO crack structure.

Across the research universe, the strongest observed Spearman relationship reached 0.6155, while the strongest observed Pearson relationship reached 0.5276. The strongest relationships consistently exhibited an inverse relationship between observable inventory pressure and future Singapore HSFO crack levels.

The concentration of strong results around similar parameter combinations suggests that the relationships identified were not isolated statistical outliers but instead reflected a coherent and repeatable market structure.

In practical terms, the results indicate that higher observable inventory pressure was associated with weaker future Singapore HSFO crack structure, while lower observable inventory conditions were associated with stronger future crack behaviour.

Executive Summary

The research programme evaluated 22,572 valid experimental configurations across more than 4.31 million underlying observation rows.

The strongest results were generated by inventory accumulation measures rather than single-week inventory snapshots. Rolling and decayed inventory methodologies consistently outperformed prompt-only inventory signals, with the highest correlations emerging from four-to-six-week inventory accumulation windows.

The top-performing models were highly consistent. The strongest configurations repeatedly combined:

ComponentStrongest Pattern Observed
ProductHSFO-only
Confidence Filterconfidence_clean
ETA Window0 to 4 weeks
Stem Statusactive_stem_only and confirmed_only
Geographycore_singapore_market and extended_bunker_supply_region
Signal Typerolling_6w_inventory_sum and decayed_equal_6w_inventory
Target Typecrack_level
Lag2 weeks forward
DirectionInverse relationship between inventory pressure and crack strength

The best observed configuration produced:

MetricResult
Spearman-0.6155
Pearson-0.5212
Observation Count202
Producthsfo_only
Geographyextended_bunker_supply_region
Confidenceconfidence_clean
Stem Statusactive_stem_only
Signal Typedecayed_equal_6w_inventory
Targettarget_crack_lag_2
Lag2 weeks
ETA Window0 to 4

The best results were economically coherent. They suggest that observable HSFO inventory pressure behaves less like a short-term event signal and more like a broader market-state variable. Persistent inventory accumulation appears to carry substantially more information about future Singapore HSFO crack structure than a single weekly arrivals snapshot.

The top 100 strongest configurations were especially important because they showed concentrated and repeatable structure:

Factor LevelShare of Top 100 Models
confidence_clean100%
hsfo_only100%
ETA window 0 to 4100%
crack_level target100%
lag 274%
lag 326%
active_stem_only43%
confirmed_only43%
confirmed_plus_projected14%

This concentration was observed simultaneously across multiple independent dimensions, including product selection, confidence filtering, ETA window construction, target type, lag structure, and stem status. The clustering of results around the same physical interpretation of the market suggests that the strongest findings are unlikely to be isolated statistical outliers and instead reflect a persistent underlying relationship between observable inventory pressure and forward HSFO pricing structure.

Research Objective

The objective of the framework was to determine whether observable physical fuel oil inventory pressure has a measurable relationship with future Singapore HSFO pricing behaviour.

The framework was designed to answer five core questions:

Research QuestionPurpose
Does observable inventory pressure matter?Tests whether physical arrivals contain pricing information.
Which product definition works best?Tests whether HSFO-only flows outperform broader HSFO-related groupings.
Which cargo certainty level matters most?Tests whether confirmed or high-confidence cargoes produce cleaner signals.
Which timing window matters most?Tests whether prompt and forward arrivals outperform broader historical windows.
Which signal construction method works best?Tests whether rolling, decayed, binary, change, or level signals perform best.

The broader institutional objective was to build a repeatable research system rather than a one-off correlation exercise.

This matters because a single strong correlation can be misleading. A robust research framework should show whether similar relationships persist across different configurations, filters, regions, products, signal types, and lag horizons.

Economic Hypothesis

The core economic hypothesis is that observable physical inventory pressure influences Singapore HSFO crack structure through regional supply balance.

The simplified logic chain is:

  • Physical cargo arrivals influence observable inventory pressure.
  • Observable inventory pressure influences regional supply availability.
  • Regional supply availability influences blending, bunker supply, refinery optimisation, and marginal fuel oil balance.
  • Those physical balance conditions are then reflected in Singapore HSFO crack pricing behaviour.

The expected relationship was inverse.

Higher observable inventory pressure should generally be associated with weaker future HSFO crack structure, because increased supply availability places pressure on prompt and nearby pricing.

Lower observable inventory pressure should generally be associated with stronger future HSFO crack structure, because tighter physical availability supports crack pricing.

The framework also tested whether this relationship appears immediately or with a lag. The strongest results emerging around the two-week forward horizon suggest that physical pressure may require time to propagate through regional balances, trading behaviour, bunker supply chains, blending systems, and benchmark pricing.

Why Physical Fuel Oil Intelligence Matters

Fuel oil markets are shaped by physical balance dynamics that are not always fully visible through price data alone.

Key physical drivers include:

DriverRelevance
Cargo arrivalsDirectly affect regional supply availability.
Stem statusDetermines whether a cargo is projected, active, confirmed, or arrived.
Product classificationDetermines whether cargoes are directly relevant to HSFO pricing.
Destination regionDetermines whether cargoes are connected to the Singapore pricing ecosystem.
Arrival timingDetermines whether supply is prompt, deferred, or already confirmed.
Inventory persistenceDetermines whether supply pressure is isolated or sustained.

Traditional price analysis starts from the price itself. This research starts from observable physical market behaviour and tests whether it can explain future price structure.

That distinction is important. If physical flow intelligence contains useful information before it is fully reflected in benchmark pricing, then cargo-level observability can become a meaningful market-state indicator.

Dataset and Experimental Universe

The research used cargo-level fuel oil intelligence aggregated into weekly inventory-pressure systems.

The framework tested:

MetricResult
Valid experimental results22,572
Total observation rows4,313,381
Average observations per valid result191.1
Best observed absolute Spearman0.6155
Best observed absolute Pearson0.5276
90th percentile absolute Spearman0.2060

The strongest Spearman and Pearson results were not necessarily generated by the same experimental configuration. The values above represent the highest absolute correlations observed across the full experimental universe.

The analysis used valid experiment results only. Results were filtered to remove insufficient observations, constant inputs, and invalid statistical outputs.

The experimental universe included multiple factor families:

FactorLevels Tested
Confidence3
Geography3
Product4
Stem Status5
ETA Window2
Weighting Set5
Signal Type21
Target Type4
Target Feature16
Lag Weeks4
Rolling Window5
Decay Type4

A separate Fuel Oil Inventory Pressure Research Factor Catalogue defines every factor level used in the research universe and can be provided on request.

Data Inputs and Weekly Alignment

The framework used three primary data inputs.

The physical inventory side of the analysis was built from Maven Knowledge cargo-level fuel oil intelligence. This included observable cargo movements, product classifications, stem status, confidence classifications, destination geography, ETA timing, and reported cargo volumes.

The pricing side of the analysis used Singapore HSFO bunker pricing and Brent crude pricing. These two series were combined to construct a Singapore HSFO crack series, allowing the framework to evaluate the relationship between observable physical inventory pressure and relative HSFO pricing strength rather than outright fuel oil prices alone.

Using the crack rather than the outright HSFO price was important because it helps isolate fuel oil-specific supply and demand dynamics from broader movements in the crude oil market.

The research period covered 2022 through 2025. This provided approximately four years of weekly market observations across multiple market regimes, including periods of tighter and weaker fuel oil market structure. A small number of gaps within the weekly data series were filled to preserve continuity and maintain a consistent observation framework throughout the study period.

The analysis was deliberately constructed around a weekly cadence. Maven Knowledge publishes physical fuel oil intelligence on a weekly reporting cycle, so all inventory observations were aligned to Monday report dates. Singapore HSFO bunker prices, Brent prices, and the derived crack series were aligned to the same weekly Monday timestamps.

This produced approximately 200 weekly Maven inventory observations across the research period.

The Monday alignment was important for three reasons.

  • It ensured that inventory and pricing observations were compared on a consistent weekly basis.
  • It avoided mixing daily price volatility with weekly physical reporting data.
  • It means the framework can be applied directly alongside Maven Knowledge’s existing weekly reporting process.

In practical terms, each weekly Maven report can be used to update the observable inventory pressure signal, compare it against the historical framework, and assess whether prevailing physical supply conditions are consistent with stronger or weaker forward Singapore HSFO crack structure.

Methodology

The methodology consisted of five steps.

Step 1: Build Physical Inventory Signals

Cargo-level physical observations were grouped into weekly inventory measures.

Each inventory signal was defined by a combination of:

DimensionExample
Producthsfo_only
Geographycore_singapore_market
Stem Statusconfirmed_only
Confidence Filterconfidence_clean
ETA Window0 to 4
Signal Typerolling_6w_inventory_sum

This created multiple plausible definitions of observable fuel oil inventory pressure.

Step 2: Align Inventory With Pricing

Inventory signals were aligned against forward Singapore HSFO crack targets.

The framework tested crack levels, crack changes, z-scored crack levels, and z-scored crack changes across one-to-four-week forward lags.

Step 3: Calculate Statistical Relationships

The framework calculated both Pearson and Spearman correlations.

MetricPurpose
PearsonMeasures linear relationship strength.
SpearmanMeasures monotonic rank relationship strength.

Spearman was especially important because physical commodity markets often behave non-linearly. A market may respond differently once inventory pressure becomes extreme, persistent, or regime-defining.

Step 4: Rank Configurations

Configurations were initially ranked using relationship strength. Observation counts and consistency across related experiments were then used as secondary validation measures when interpreting the results.

The strongest observed configurations were reviewed alongside broader factor-level performance to avoid over-interpreting isolated outliers.

Step 5: Analyse Factor-Level Structure

The framework then assessed which factors and factor levels appeared most frequently in the strongest models.

This matters because repeatability is more important than a single winning result.

The strongest evidence came from factor levels that combined:

Evidence TypeWhy It Matters
High best SpearmanShows maximum observed explanatory relationship.
High 90th percentile SpearmanShows strength across better-performing configurations.
Frequent top-100 appearanceShows consistency among strongest models.
Economic plausibilityConfirms the result makes market sense.

Executive Results

The top 10 results were tightly clustered and economically consistent.

RankSpearmanPearsonGeographyProductStatusConfidenceSignalTargetLag
1-0.6155-0.5212extended_bunker_supply_regionhsfo_onlyactive_stem_onlyconfidence_cleandecayed_equal_6w_inventorycrack_level2
2-0.6155-0.5212extended_bunker_supply_regionhsfo_onlyconfirmed_onlyconfidence_cleandecayed_equal_6w_inventorycrack_level2
3-0.6154-0.5212extended_bunker_supply_regionhsfo_onlyactive_stem_onlyconfidence_cleanrolling_6w_inventory_sumcrack_level2
4-0.6154-0.5212extended_bunker_supply_regionhsfo_onlyconfirmed_onlyconfidence_cleanrolling_6w_inventory_sumcrack_level2
5-0.6152-0.5203core_singapore_markethsfo_onlyactive_stem_onlyconfidence_cleandecayed_equal_6w_inventorycrack_level2
6-0.6152-0.5203core_singapore_markethsfo_onlyconfirmed_onlyconfidence_cleandecayed_equal_6w_inventorycrack_level2
7-0.6151-0.5203core_singapore_markethsfo_onlyactive_stem_onlyconfidence_cleanrolling_6w_inventory_sumcrack_level2
8-0.6151-0.5203core_singapore_markethsfo_onlyconfirmed_onlyconfidence_cleanrolling_6w_inventory_sumcrack_level2
9-0.6107-0.5184broad_south_east_asiahsfo_onlyactive_stem_onlyconfidence_cleanrolling_6w_inventory_sumcrack_level2
10-0.6107-0.5184broad_south_east_asiahsfo_onlyconfirmed_onlyconfidence_cleanrolling_6w_inventory_sumcrack_level2

The top models show a clear and consistent pattern.

  • They are all HSFO-only.
  • They all use the highest-confidence cargo filter.
  • They all test future crack levels rather than crack changes.
  • They all use a two-week forward pricing horizon.
  • They all rely on six-week inventory accumulation measures, either through rolling inventory construction or equal-weight decayed inventory construction.
  • They all show an inverse relationship between inventory pressure and future crack strength.

Many of the highest-ranked configurations differ only marginally in geography, stem status, or inventory construction methodology, yet produce almost identical correlation values. This suggests the relationship is robust to reasonable variations in model specification rather than being dependent on a single narrowly-defined model structure.

This consistency is one of the most important findings in the research.

Factor Importance Ranking

The framework assessed how much each factor changed performance across its internal factor levels.

This is not presented as an average correlation. Instead, it is used as a diagnostic measure of factor separation: how much difference there was between the strongest and weakest levels within the same factor.

The chart below summarises factor separation across the full experimental universe. Larger separation indicates that the research results were more sensitive to that particular framework choice.

Research Design Factors

RankFactorSeparation ScoreInterpretation
1Signal Type0.1554Signal construction was the most important design choice.
2Stem Status0.0346Cargo operational maturity materially affected signal quality.
3Confidence0.0340Higher-confidence filtering materially improved results.
4Product0.0286Product definition mattered, with HSFO-only dominant in top models.
5Rolling Window0.0265Rolling accumulation window influenced signal strength.
6Decay Type0.0183Equal decay outperformed more complex decay structures.
7ETA Window0.0132Forward prompt windows outperformed wider backward-looking windows.
8Weighting Set0.0087Named weighting sets mattered less than rolling and decayed accumulation systems.
9Geography0.0005Geography mattered less on average, although the top models still clustered around Singapore-linked regions.

Pricing Target Factors

RankFactorSeparation ScoreInterpretation
1Target Feature0.1493Crack level lag targets clearly outperformed crack change targets.
2Target Type0.1264Crack levels were materially stronger than crack changes or z-scored changes.
3Lag Weeks0.0157Two and three-week forward horizons dominated the strongest models.

This ranking is important because it shows that the research result was not driven only by geography or product selection. The biggest design decision was how inventory pressure was constructed.

Rolling and decayed inventory accumulation systems consistently outperformed simple inventory definitions, reinforcing the view that persistent inventory pressure contains more explanatory information than single-period inventory observations.

Confidence Factor Analysis

Why Confidence Was Tested

Cargo-level physical intelligence contains varying degrees of certainty.

Some observations represent confirmed physical movement. Others include uncertainty around destination, product, routing, or voyage outcome.

The research therefore tested whether stricter cargo confidence filtering improves signal quality.

Results

RankConfidence LevelBest SpearmanP90 SpearmanTop 100 Share
1confidence_clean0.61550.2449100%
2navigation_clean0.47730.18440%
3base_clean0.47730.18530%

Interpretation

The confidence_clean filter dominated the strongest results.

It produced the highest observed Spearman relationship, the strongest P90 Spearman result, and appeared in 100% of the top 100 configurations.

This strongly supports the hypothesis that cargo confidence and data quality matter. Removing uncertain or lower-quality cargo observations materially improved the structural relationship between observable inventory pressure and future Singapore HSFO crack behaviour.

Practical Takeaway

For research and monitoring purposes, the highest-confidence cargo universe should be treated as the primary analytical dataset.

Broader datasets may remain useful for market context, but the strongest pricing relationships emerged from cleaner, higher-certainty physical flow data.

Geography Factor Analysis

Why Geography Was Tested

Singapore HSFO pricing may respond to different geographic definitions of supply pressure.

A narrow Singapore-focused region may capture direct local inventory conditions, while a broader South East Asian region may capture regional bunker supply, blending, and redistribution effects.

Results

RankGeographyBest SpearmanP90 SpearmanTop 100 Share
1core_singapore_market0.61520.208135%
2extended_bunker_supply_region0.61550.205537%
3broad_south_east_asia0.61070.204528%

Interpretation

Geography showed less separation than other factors. All three geography definitions produced strong results and all appeared in the top 100 models.

The results should not be interpreted as a single geography dominating the framework. The core_singapore_market ranked highest on the factor-level ranking and produced the strongest P90 Spearman result. The extended_bunker_supply_region produced the single best observed Spearman result and appeared most frequently in the top 100 configurations. The broad_south_east_asia definition also performed strongly, with only a modest drop in best Spearman and P90 Spearman.

This suggests that the strongest inventory relationship is not confined to one narrow geographic definition. Instead, Singapore HSFO pricing appears connected to a broader regional supply ecosystem.

Practical Takeaway

The evidence supports treating Singapore as part of a wider regional fuel oil and bunker supply system rather than as an isolated local inventory point.

For ongoing monitoring, both core Singapore-linked arrivals and broader regional bunker supply should be tracked.

Product Factor Analysis

Why Product Was Tested

The framework tested whether Singapore HSFO pricing is best explained by pure HSFO cargo flows or by broader HSFO-related material, including SRFO and heavy sweet-related cargoes.

Results

Product LevelBest SpearmanP90 SpearmanTop 100 Share
hsfo_only0.61550.2103100%
hsfo_hsweet0.39880.13870%
hsfo_srfo0.38200.20770%
hsfo_srfo_hsweet0.37680.21410%

Interpretation

The product comparison produced one of the clearest findings in the entire research programme.

HSFO-only cargo definitions materially outperformed all broader HSFO-related product groupings, suggesting that pricing relationships were strongest when the inventory signal remained closely aligned with the underlying benchmark market.

HSFO-only appeared in 100% of the top 100 configurations and produced the strongest observed Spearman relationship by a substantial margin.

None of the broader product groupings appeared in the top 100 configurations. While some alternative groupings produced respectable factor-level results, they did not generate the strongest observed relationships.

This is an important finding because it suggests that direct HSFO physical flow observability was more informative than broader HSFO-adjacent product aggregation.

Practical Takeaway

For Singapore HSFO crack analysis, HSFO-only physical inventory pressure should remain the primary signal definition.

Broader product groupings may still provide market context, but they should not replace HSFO-only analysis in the core framework.

Stem Status Factor Analysis

Why Stem Status Was Tested

Cargo status captures the operational maturity of a physical movement.

Projected cargoes may contain forward-looking information, but they also introduce uncertainty. Active and confirmed cargoes are more physically reliable and may therefore produce cleaner pricing relationships.

Results

Stem StatusBest SpearmanP90 SpearmanTop 100 Share
active_stem_only0.61550.233243%
confirmed_only0.61550.233243%
confirmed_plus_projected0.59120.221514%
projected_only0.54050.14930%

A fifth stem-status category, anchored_only, was included within the broader experimental universe but did not generate sufficient valid statistical outputs to form part of the factor-level comparison presented here.

Interpretation

Active and confirmed cargoes clearly dominated the strongest results.

Together, active_stem_only and confirmed_only accounted for 86% of the top 100 configurations while producing the highest observed Spearman relationships and strongest P90 performance.

confirmed_plus_projected also appeared in the top 100, but at a much lower frequency. projected_only did not appear in the top 100 configurations.

This suggests that physical certainty matters. The market relationship was strongest when the inventory signal was built from cargoes with greater operational maturity and higher discharge confidence.

Practical Takeaway

Confirmed and active voyage cargoes should form the core of the inventory pressure framework.

Projected cargoes may be useful for forward market colour, but they should be treated carefully and should not dominate the primary signal.

ETA Window Factor Analysis

Why ETA Window Was Tested

The ETA window determines whether inventory pressure is measured using only prompt and forward arrivals, or whether backward-looking confirmation windows are also included.

Results

RankETA WindowBest SpearmanP90 SpearmanTop 100 Share
10 to 40.61550.2234100%
2-6 to 40.54050.20020%

Interpretation

The 0 to 4 week ETA window dominated.

It appeared in 100% of the top 100 configurations and produced the strongest observed relationship.

The narrower forward-looking window consistently outperformed the broader window that incorporated six weeks of backward-looking arrival confirmation. This suggests that the strongest pricing relationships were associated with prompt and forward observable supply pressure rather than inventory definitions that incorporated older arrival information.

Practical Takeaway

The primary framework should focus on the current week through four weeks forward.

Backward confirmation may still have research value, but the strongest pricing relationships emerged from prompt and forward observable supply pressure.

Weighting Set Factor Analysis

Why Weighting Sets Were Tested

The framework tested named weighting methodologies to determine whether prompt, forward, balanced, or backward-confirmation weighting curves improved signal construction.

These weighting sets were designed to test whether different arrival timing assumptions improved the relationship between inventory pressure and pricing.

Results

RankWeighting SetBest SpearmanP90 SpearmanTop 100 Share
1target_week_only0.54050.24690%
2backward_confirmation0.34600.21970%
3tight_prompt0.38980.20550%
4balanced_curve0.43030.19240%
5forward_heavy0.42980.19310%

Interpretation

The named weighting sets produced useful results, but they did not appear in the top 100 strongest configurations.

This does not mean weighting is irrelevant. Rather, the strongest models came from rolling and decayed inventory accumulation signals that did not rely on these named weighting_set_name fields.

The best named weighting result came from target_week_only, which suggests that when simple weighting systems are used, prompt and target-week supply are more informative than more distributed weighting curves.

Practical Takeaway

Named weighting sets should remain part of the research framework, but the primary signal architecture should prioritise rolling and decayed accumulation systems.

Prompt weighting remains economically intuitive, but persistence-based inventory construction performed better.

Signal Type Factor Analysis

Why Signal Type Was Tested

Signal construction was one of the most important research questions.

The framework tested whether pricing relationships are stronger when inventory pressure is measured as:

Signal FamilyPurpose
Volume levelAbsolute inventory pressure.
Volume changeInventory acceleration or deceleration.
Z-scoreAbnormal inventory pressure versus history.
Binary high inventoryHigh-pressure regime indicator.
Rolling sumPersistent accumulated inventory pressure.
Decayed inventoryTime-weighted accumulated inventory pressure.

Results

Signal TypeBest SpearmanP90 SpearmanTop 100 Share
decayed_equal_6w_inventory0.61550.591015%
rolling_6w_inventory_sum0.61540.590915%
rolling_8w_inventory_sum0.60720.582910%
decayed_equal_8w_inventory0.60720.582910%
decayed_equal_4w_inventory0.60530.583112%
rolling_4w_inventory_sum0.60530.583112%
rolling_3w_inventory_sum0.59630.58198%
decayed_linear_4w_inventory0.58760.56206%
decayed_front_loaded_4w_inventory0.58760.56206%
rolling_2w_inventory_sum0.56380.56110%
volume_zscore0.54050.23730%
volume_level0.53860.25590%
binary_high_inventory0.31840.17570%
volume_change0.18310.11290%

Interpretation

Signal type was the most important design factor in the entire framework.

Rolling and decayed inventory accumulation systems clearly outperformed simple weekly volume level, z-score, binary, and change signals.

The strongest observed signal was decayed_equal_6w_inventory, with a best Spearman of 0.6155. rolling_6w_inventory_sum was effectively identical, with a best Spearman of 0.6154.

The top 100 models were heavily concentrated in rolling and decayed accumulation systems:

Signal TypeTop 100 Share
decayed_equal_6w_inventory15%
rolling_6w_inventory_sum15%
decayed_equal_4w_inventory12%
rolling_4w_inventory_sum12%
decayed_equal_8w_inventory10%
rolling_8w_inventory_sum10%

The strongest results consistently emerged from four-to-eight-week inventory accumulation systems. This strongly supports the conclusion that persistent inventory pressure contains substantially more information than isolated weekly inventory observations.

Practical Takeaway

The primary signal architecture should focus on rolling and decayed inventory accumulation, particularly four-to-six-week systems.

Simple week-on-week inventory change was not a strong signal in this framework.

Rolling Window Factor Analysis

Why Rolling Windows Were Tested

Rolling windows test whether sustained inventory pressure is more important than a single weekly observation.

The framework tested two, three, four, six, and eight-week rolling periods.

Results

Rolling WindowBest SpearmanP90 SpearmanTop 100 Share
2 weeks0.56380.56110%
3 weeks0.59630.58198%
4 weeks0.60530.568936%
6 weeks0.61550.566036%
8 weeks0.60720.558920%

Interpretation

The best single result came from the six-week window, while the top 100 models were dominated by four and six-week windows.

Together, four and six-week rolling windows accounted for 72% of the top 100 model appearances.

The results suggest that inventory pressure becomes more informative when accumulated across multiple weeks rather than measured as a single-period observation. Performance improved materially between the shorter two and three-week windows and the longer four, six, and eight-week windows.

This supports the conclusion that inventory pressure operates as a persistent market-state variable rather than a purely short-term event signal.

Practical Takeaway

Four-to-six-week rolling inventory systems should be prioritised in future research and monitoring.

Eight-week windows remain useful, but may begin to dilute the immediacy of the physical signal.

Decay Type Factor Analysis

Why Decay Type Was Tested

Decay type tests whether recent observations should receive more weight than older observations when constructing accumulated inventory signals.

The framework tested equal, linear, exponential, and front-loaded decay structures.

Results

Decay TypeBest SpearmanP90 SpearmanTop 100 Share
equal0.61550.586237%
linear0.59110.561912%
front_loaded0.58760.54756%
exponential0.58210.54850%

Interpretation

Equal weighting performed best.

It produced the strongest observed Spearman relationship, the highest P90 Spearman value, and the largest share of top 100 model appearances.

This is a notable result. It suggests that, within the tested rolling windows, sustained inventory pressure matters more than aggressively prioritising the most recent week.

In other words, the market relationship appears to be driven by persistent supply pressure accumulated across several weeks rather than only the freshest arrival information.

The fact that more aggressive decay structures failed to outperform equal weighting suggests that inventory persistence may be more important than recency alone when explaining future Singapore HSFO crack behaviour.

Practical Takeaway

Equal-weight rolling and decayed accumulation systems should remain the default for the strongest inventory pressure framework.

More complex decay functions may be useful for future modelling, but they did not outperform equal accumulation in this research set.

Target Type Factor Analysis

Why Target Type Was Tested

The framework tested whether inventory pressure relates more strongly to crack levels, crack changes, standardised crack levels, or standardised crack changes.

Results

RankTarget TypeBest SpearmanP90 SpearmanTop 100 Share
1crack_level0.61550.4190100%
2crack_zscore0.54050.16560%
3crack_change0.19150.11100%
4crack_change_zscore0.18310.10350%

Interpretation

The relationship was strongest against crack levels, not crack changes.

This is an important structural finding. It suggests that observable inventory pressure is more closely associated with the state of the forward crack structure than with short-term weekly price movements.

The results also indicate that level-based targets outperformed both change-based and standardised target definitions. The strongest relationships emerged when inventory pressure was compared directly against the underlying crack structure itself.

That supports the interpretation of inventory pressure as a market-state variable rather than a short-term directional trigger.

Practical Takeaway

The framework should continue to prioritise crack level targets for structural research.

Crack change targets may be more relevant for future trading signal development, but they were not the strongest target type in this phase.

Target Feature and Lag Analysis

Why Lag Was Tested

The framework tested whether inventory pressure relates to future pricing behaviour across one-to-four-week horizons.

The strongest results would indicate where physical supply pressure appears to transmit into pricing structure.

Target Feature Results

Target FeatureBest SpearmanP90 SpearmanTop 100 Share
target_crack_lag_30.59530.524526%
target_crack_lag_20.61550.537674%
target_crack_lag_40.58070.51760%
target_crack_lag_10.51210.29530%

Lag Results

Lag WeeksBest SpearmanP90 SpearmanTop 100 Share
30.59530.207126%
40.58070.21310%
20.61550.219774%
10.54050.18620%

Interpretation

The strongest single results came from the two-week forward crack level target.

The top 100 results were concentrated entirely in two and three-week forward crack level targets, with the two-week horizon appearing in 74% of the top 100 models.

This supports the idea that physical inventory pressure does not always transmit instantly into benchmark pricing. A two-to-three-week window appears to be the strongest observed pricing transmission horizon.

Practical Takeaway

The two-week forward crack level should be treated as the primary target horizon for the current framework.

The three-week horizon is also important and should remain part of monitoring and future development.

Cross-Factor Findings

The top 100 model frequency analysis provides the clearest view of recurring structure.

FactorDominant Top-100 Result
Confidenceconfidence_clean appeared in 100%
Producthsfo_only appeared in 100%
ETA Window0 to 4 appeared in 100%
Target Typecrack_level appeared in 100%
Laglag 2 appeared in 74%, lag 3 in 26%
Stem Statusactive_stem_only and confirmed_only each appeared in 43%
Geographyextended_bunker_supply_region appeared in 37%, core_singapore_market in 35%, broad_south_east_asia in 28%
Signal Typerolling and equal-decayed inventory accumulation dominated
Rolling Window4-week and 6-week windows each appeared in 36%
Decay Typeequal decay appeared most frequently

The same pattern becomes visible when the strongest 100 configurations are analysed collectively. Several framework characteristics repeatedly appeared among the strongest-performing inventory systems.

Most notably, confidence_clean, hsfo_only, the 0 to 4 ETA window, and crack_level targets appeared in 100% of the top 100 configurations. This level of concentration is unusually strong and indicates that the strongest results were clustered around a highly consistent set of design choices.

The most important conclusion is that the strongest models were not scattered randomly across the experimental universe.

They clustered around a coherent structure:

  • High-confidence HSFO-only cargoes.
  • Prompt-to-forward ETA windows.
  • Operationally active or confirmed stems.
  • Rolling or equal-decayed inventory accumulation systems.
  • Future crack levels.
  • Two-to-three-week forward pricing horizons.

This consistency materially increases confidence in the economic interpretation of the results.

Strongest Observed Configuration

The strongest observed configuration was:

FactorLevel
Geographyextended_bunker_supply_region
Producthsfo_only
Stem Statusactive_stem_only
Confidenceconfidence_clean
ETA Window0 to 4
Signal Typedecayed_equal_6w_inventory
Rolling Window6
Decay Typeequal
Target Typecrack_level
Target Featuretarget_crack_lag_2
Lag Weeks2

This configuration produced:

MetricResult
Spearman-0.6155
Pearson-0.5212
Observation Count202
Combined Score0.5684

The direction of the relationship was negative.

The strongest observed configuration is illustrated below. The relationship demonstrates how sustained observable HSFO inventory pressure was associated with weaker future Singapore HSFO crack levels across the research period.

Higher observable HSFO inventory pressure was associated with weaker future Singapore HSFO crack levels.

Best Average Factor-Level Profile

The strongest observed model is not the same as the best average factor-level profile.

The strongest observed model identifies the single highest-performing configuration across the entire experimental universe. By contrast, the best average factor-level profile identifies which factor levels generally performed best within their own factor group when assessed across all experiments.

FactorBest Average Level
Confidenceconfidence_clean
Geographycore_singapore_market
Producthsfo_only
Stem Statusactive_stem_only and confirmed_only
ETA Window0 to 4
Signal Typerolling_3w_inventory_sum
Target Typecrack_level
Lag Weeks3
Rolling Window3
Decay Typeequal
Weighting Settarget_week_only

This table should not be interpreted as a single model specification. Instead, it identifies the strongest-performing factor level within each individual factor category when evaluated on a factor-level basis.

As a result, some entries differ from the strongest observed configuration. This is expected. The profile reflects average factor-level performance rather than the highest-performing combination of factors.

The strongest actual observed model remains the decayed_equal_6w_inventory configuration against two-week forward crack levels.

Structural Interpretation

The research supports several broader conclusions.

Observable Inventory Pressure Matters

The strongest configurations demonstrate a meaningful relationship between observable HSFO inventory pressure and future Singapore HSFO crack structure.

The best observed Spearman of -0.6155 and Pearson of -0.5212 are strong for a physical commodity market framework built from observable cargo-level data.

Persistence Matters More Than Isolated Events

Rolling and decayed inventory accumulation systems dominated the strongest results.

This suggests that persistent inventory pressure is more informative than a single weekly cargo snapshot.

HSFO-Specific Flows Matter Most

HSFO-only appeared in 100% of the top 100 models.

This indicates that direct product relevance matters. Broader HSFO-adjacent product groupings did not improve the strongest signal.

Cargo Confidence Matters

confidence_clean appeared in 100% of the top 100 models.

This is one of the clearest findings in the study. Higher-quality cargo filtering materially improved the framework.

Operational Maturity Matters

Active and confirmed cargoes dominated the strongest stem status results.

This suggests that the market relationship strengthens when inventory pressure is built from physically mature cargoes rather than lower-certainty projections alone.

Crack Levels Matter More Than Crack Changes

The framework performed much better against future crack levels than against weekly crack changes.

This supports the interpretation that inventory pressure explains structural market state rather than short-term price noise.

Pricing Transmission Appears Delayed

The strongest results appeared at the two-week forward horizon, with three-week targets also represented in the top 100.

This suggests that observable physical pressure may take time to transmit through regional fuel oil balances and benchmark pricing.

Institutional Relevance

The framework may be relevant for:

ParticipantPotential Use
Commodity hedge fundsPhysical-to-paper market structure research.
Fuel oil trading desksInventory regime monitoring and forward crack context.
Bunker market participantsRegional supply pressure assessment.
Refinery optimisation teamsFuel oil balance and margin context.
Market analystsPhysical intelligence integration into pricing frameworks.
Risk teamsInventory-state awareness and market regime interpretation.

The framework should not be viewed as a standalone trading signal.

Its value lies in structural interpretation. It helps explain whether observable physical supply conditions are consistent with stronger or weaker forward pricing structure.

Limitations

The research is intentionally correlation-based.

Correlation does not prove causation.

The framework measures observable cargo-level inventory pressure, not complete tank inventory.

Some physical flows may be missed, misclassified, delayed, rerouted, blended, or commercially reallocated.

The pricing relationship may change across market regimes.

The research period may include specific structural conditions that do not repeat exactly in future periods.

Some experimental configurations were intentionally broad or exploratory and were not expected to perform strongly.

The results should therefore be interpreted as evidence of structural relationship, not as a deterministic forecasting model.

Future Research

Future development should focus on several areas.

Regime Segmentation

The framework should be tested across distinct market regimes, including tight markets, oversupplied markets, high-volatility periods, and low-volatility periods.

Rolling Stability

The strongest relationships should be assessed through time to determine whether they remain stable or cluster around specific subperiods.

Non-Linear Modelling

Because Spearman relationships were often stronger than Pearson relationships, future work should explore non-linear and regime-based models.

Multi-Factor Signal Construction

Future modelling may combine inventory pressure with other physical and market variables, including bunker demand, refinery margins, arbitrage economics, freight, and forward curve structure.

Predictive Engine Development

The current framework is a structural research engine. A future phase could convert the strongest configurations into a monitored prediction or regime-classification system.

Cross-Product Expansion

The framework could be extended to LSFO, VLSFO, SRFO, VGO, and other residual or blending-related product groups.

Conclusion

The Maven Fuel Oil Inventory Pressure Research Framework demonstrates that observable physical fuel oil inventory pressure exhibits meaningful and economically coherent relationships with future Singapore HSFO crack pricing behaviour.

The strongest observed configuration produced a Spearman correlation of 0.6155 and a Pearson correlation of 0.5212.

The strongest configurations were consistent, not random. They repeatedly relied on HSFO-only cargoes, high-confidence filtering, active or confirmed stems, prompt-to-forward ETA windows, rolling or decayed inventory accumulation, and two-week forward crack level targets.

Four framework characteristics appeared in 100% of the strongest 100 configurations: confidence_clean, hsfo_only, the 0–4 week ETA window, and crack_level targets. This concentration provides additional evidence that the strongest results were clustered around a highly coherent and repeatable market structure.

The framework was designed to answer five core research questions.

Does observable inventory pressure matter?

Yes. The strongest configurations consistently demonstrated meaningful inverse relationships between observable HSFO inventory pressure and future Singapore HSFO crack behaviour.

Which product definition works best?

HSFO-only cargo definitions performed best. HSFO-only appeared in 100% of the strongest 100 configurations and materially outperformed broader HSFO-related product groupings.

Which cargo certainty level matters most?

High-confidence and operationally mature cargoes performed best. Confidence-clean filtering appeared in 100% of the strongest 100 configurations, while active-stem and confirmed-only frameworks consistently outperformed broader cargo sets.

Which timing window matters most?

Prompt and near-forward arrivals performed best. The 0–4 week ETA window appeared in 100% of the strongest 100 configurations and consistently outperformed broader timing definitions.

Which signal construction method works best?

Rolling and decayed inventory accumulation systems produced the strongest relationships. Persistent inventory pressure proved more informative than isolated weekly cargo movements.

The central conclusion is that observable HSFO inventory pressure behaves less like a short-term event signal and more like a persistent physical market-state variable.

Higher observable inventory pressure was associated with weaker future Singapore HSFO crack levels.

Lower observable inventory pressure was associated with stronger future crack structure.

The research supports the institutional case for incorporating systematic physical cargo intelligence into fuel oil market analysis. It also provides a repeatable framework for monitoring regional inventory pressure, assessing forward crack structure, and developing future physical-to-paper pricing models.

Full experimental outputs, factor definitions, and statistical result sets are available for institutional review.

For additional information, institutional discussions, collaboration enquiries, or trial access to Maven Knowledge physical fuel oil intelligence datasets and reporting, please contact Maven Knowledge.