Impacts and Implications: The Future of Sustainability Reporting and Assessment
Master in European Studies 2013/2014
Thesis Advisor: Professor Andrea Renda
Krystal Crumpler
November 15, 2014
2
TABLE OF CONTENTS INTRODUCTION 4 CHAPTER 1: THE EMERGENCE OF TRANSANTIONAL REGULATORY SCHEMES 8
GAPS IN THE REGULATORY PROCESS 10
A CROSS-‐POLINATION OF AGENDAS 12
TYPOLOGIES AND PROLIFERATION OF TPR 13
PLAYERS AND THE PLURALITY OF INTERESTS 14
THE SUBJECT OF REGULATION AND QUESTION OF LEGITIMACY 16
HOW PRIVATE REGULATION OPERATES 17
THE USE AND ABUSE OF THE SUPPLY CHAIN 18
TPR IN SEARCH OF A DEFINITION 20
HOW GOVERNANCE CAN EFFECT EFFECTIVENSS? 22
WHAT TO MAKE OF EFFECTIVNESS 25
THE (MIS)USE OF INDICATORS 27
RE-‐PIECING THE PUZZLE: COMPLIANCE, EFFECTIVNESS, AND THE LEARNING CURVE
28
THE FIVE Ws OF TPR 31
HOLES IN THE SYSTEM 33
CHAPTER 2: SUSTAINABILITY IN SEARCH OF A DEFINITION 35
RECONSIDERING SUSTAINABILITY VIS-‐À-‐VIS SAFA 36
SUSTAINABLE DEVELOPMENT: WHAT LIES AT STAKE? 37
WHO USES SAFA? 39
DEFINING BOUNDARIES: THE SCOPE OF SAFA 40
SAFA STEP-‐BY-‐STEP 42
STEP #1: MAPPING MATERIAL MATTERS 43
PROCEDURAL ISSUE: HOW TO MAP SMALL-‐SCALE PRODUCERS? 44
STEP #2: CONTEXTUALIZING DATA AND OUR READING OF IT 46
STEP #3: SELECTING APPROPRIATE TOOLS AND INDICATORS 47
3
SUSTAINABILITY PERFORMANCE 49
STEP #4: REPORTING PERFORMANCE IN A “SNAPSHOT” 52
CHAPTER #3: TESTING SAFA VIS-‐À-‐VIS THE PILOT PHASE 54
SAFA “LIVE” FROM THEORY TO PRACTICE 55
SAFA “HOT-‐SPOT” #1: INEFFECTIVE COMPLIANCE CHECKS 56
SAFA “HOT-‐SPOT” #2: INDICATOR CHALLENGES 59
SETTING BOUNDARIES: SPATIAL AND TEMPORAL SPHERES OF INFLUENCE 59
BETWEEN FLEXIBILITY AND RIGIDITY: INDICATORS IN SEARCH OF A THRESHOLD
62
BLURRED THRESHOLDS: WHAT TO DO WITH THE SPACES-‐IN-‐BETWEEN 63
INDICATOR TYPES 65
SAFA “HOT-‐SPOT” #3: (IN)APPROPERIATE METRICS 66
SAFA “HOT-‐SPOT” #4: DATA COLLECTION 69
SAFA “HOT-‐SPOT” #5: IMPLICATIONS FOR SMALL-‐HOLDERS 72
SAFA “HOT-‐SPOT” #6: FUTURE UTILITY? 74
CONCLUSION 77
BIBLIOGRAPHY 79
4
INTRODUCTION In a world of transgressed borders and blurred boundaries, the notion of
globalization evidences the permeation between spheres of the public and the
private and the divide between the national and the international, leading to a sense
of transnationalism—a space and place of transgression(s) between self and other;
the individual and the collective; the national and the international. In a globalized
world comprising cross-‐border flows of products, services, people, and ideas, the
space of the global market becomes an ever-‐more complicated one insofar as its
boundaries are perpetually expanding and its regulation exponentially complicated.
In such a way, governance transcends domestic borders so-‐to-‐speak, escaping
national jurisdiction and calling upon transnational regulation as a means of “making
sense” of these transgressions. In response to the functional and organizational
failures of conventional regulatory bodies (i.e. the weakness of the state, the
fragmentation of international law, or the lack of enforcement mechanisms), the
emergence of transnational private regulation (TPR) represents a new body of rules,
practices, and processes, agreed upon by private actors, firms, NGOs, and experts,
which exercise regulatory power autonomously or by virtue of delegation from
international law or national legislation.
In Chapter One, we shall explore both the emergence and evolution of transnational
private regulation, as an intersection between the public/private and
national/international divide, presenting an opportunity for cooperation among
actors rather than competition between them in the context of transnational
regulatory regimes. In such a way, the nature of global value chains – and the use of
supply chains as instruments for trans-‐national cooperation – represents a radical
new space for transnational regimes to revise the regulatory process. As we shall
discuss throughout the paper, the use of the supply chain as both the site of and
instrument for transnational regulation represents an innovative approach in
effecting effective regulation across multiple forms of governance and among
regulatory relationships. Thus, the use of the supply chain marks a comprehensive
5
approach of transnational private regulation, speaking to its regulatory potential in
the realms of food security, environmental safety, and human rights protection, to
name a few.
Consequently, common frameworks have been established to promote cooperation
among global actors, such as the GlobalGap for food certification schemes, the
Global Reporting Initiative (GRI) for non-‐financial corporate social responsibility
reporting, or the ISEAL Impacts Code for good practices in setting social and
environmental sustainability standards. However, in consideration of the lack of a
hegemonic, international regulator, one may question the legitimacy and
effectiveness of TPR in such absence; that is, the absence of a transnational
arbitrator enforcing and monitoring the implementation and compliance of given
regulatory schemes. In the analysis, then, of the interdependent links between
regulatory relationships and forms of governance, the argument presented herein
calls upon the need for objective evaluation mechanisms, sound ex ante standard-‐
setting and ex post indicator-‐selection, and comprehensive impact assessments in
order to evaluate the effectiveness of TPR. That is, we shall inquire upon the effects
of governance and, successively, how governance can effect effectiveness.
As the proliferation of transnational regulatory schemes continually expands, the
number of environmental, social, and labor codes continually multiply; Koritza
reported in 2013 on the emergence of “several million certified farm and tens of
billions of annual sales of coffee, tea, cocoa, sugar, fruit and other commodities that
meet Fairtrade, Rainforest Alliance or UTZ Certified standards.” 1 Over 20,000
different products bear the blue Marine Stewardship Council (MSC) seal, and 180
million hectares of forest meet the Forest Stewardship Council (FSC) seal. As 106
countries have established national sustainable development strategies, 120
voluntary sustainability standards, eco-‐labels, codes of conduct, and auditor
protocols have emerged, along with 300 different industry codes for social and
1 Koritza, L. 2013, Certification showing it’s worth the investment, Sustainable Brands. (available at http://www.sustainablebrands.com).
6
environmental practices have been developed.2 Despite, however, the increased
number of standards and certification schemes of sustainable performance, the
notion of “sustainability,” in and of itself, becomes ever more evanescent, slipping
between multifaceted forms of its constitution. Hence, the present paper calls upon
the need for an integrated approach in the assessment of regulatory effectiveness.
Considering the intricacy (or, perhaps, impossibility) of compounding multiple
frameworks into a single, common language, lens, and set of rules, we shall analyze
the efforts set for the by the FAO, which seeks to simultaneously simplify
sustainability assessment while defining a new, encompassing framework for it,
establishing a comprehensive, step-‐by-‐step tool for assessing 4 crucial dimensions of
sustainability – good governance, environmental integrity, economic resilience, and
social well-‐being – along food and agricultural value chains.3 In Chapter 2, we shall
present an argument for the merit of the SAFA tool in the field of sustainability
assessment and reporting, as well as its implications for the future. With that in
mind, we shall stress how one must continually call into question what lies at stake
in measuring that which we cannot define; as the definition of “sustainability”
forever escapes the definer, a similar dynamic consequently takes place in the
measurement of it. That is, despite the number of mechanisms, tools, and metrics
for sustainable development perpetually have increased over time, sustainability per
se continuously diminishes.
Finally, in Chapter 3, we shall analyze the SAFA pilot studies performed in 2012 prior
to the finalization of the current SAFA Guidelines of 2013 by 19 small, medium, and
large enterprises in 23 countries and across a variety of agricultural and food sectors,
ranging from tobacco production in Tanzania to organic fruit farmers in Peru. In the
analysis of the SAFA pilots, the paper attempts to critically assess the proposed
2 Cafaggi, F. and A. Renda. 2014. Measuring the effectiveness of private regulatory organizations, forthcoming, p. 8. 3 FAO. 2013. Sustainability Assessment Food and Agricultural Systems Guidelines. Food and Agriculture Organization of the United Nations, Rome (available at http://www.fao.org/fileadmin/templates/nr/sustainability_pathways/docs/SAFA_Guidelines_Final_122013.pdf
7
themes, sub-‐themes, and indicators for sustainability, evaluate the entities’
compliance with the guidelines for self-‐assessment, identify the challenges
encountered within the pilot phase and the major shortcomings of the tool, and
analyze how (and to what degree) the organization incorporated the suggested
measures of improvement within the current version of SAFA available today.
Considering, then, both the merits and limitations of the tool – a tool designed for
effective sustainability assessment – the following discussion seeks to assess the
effectiveness of the tool itself, presenting a qualitative analysis of the “gaps” found
within its regulatory scheme and questioning if and how the new version/vision of
the SAFA framework resolves such shortcomings, serving as a case study by which
the context of transnationalism enables us to interrogate the future of sustainability
assessment—and its particular stake in social progress.
8
CHAPTER 1: THE EMERGENCE OF TRANSANTIONAL REGULATORY SCHEMES
As our global economy becomes ever so more complex, the diversity of our
regulatory schemes, too, must adopt to the conflict, confluence, and
interdependence presented by the dynamic between multiple regulatory processes
and forms of governance. The consequential shift of the “regulatory space” from
public to private spheres and its transition in scope from a national to transnational
one—as well as the interplay between the two extremes—marks a radical revision of
the conventional regulatory space. Instead, this very interplay between the
public/private and domestic/international divide constitutes the emergence of a
new form of regulation, which aims to address the possibilities-‐in-‐between such
regulatory space; that is, transnational private regulation (TPR). In response to
functional and organizational failures of conventional regulatory bodies (i.e. the
weakness of the state, the fragmentation of international law, or the lack of
enforcement mechanisms), transnational private regulation represents a new body
of rules, practices, and processes, agreed upon by private actors, firms, NGOs, and
experts, which exercise regulatory power autonomously or by virtue of delegation
from international law or national legislation.4 In such a way, TPR redistributes
regulatory power across the domestic and global spheres, as well as between public
and private regulators, revealing the role of “actors” and their impacts in the
transnational arena.
Considering the exponentially expanding combination of variables that contribute to
the emergence of private regulation, as Cafaggi and Renda argue, “the regulatory
schemes are not created equal,” in which differentiating factors for any given set of
schemes include, “their origin, the intended beneficiaries, the extent to which these
schemes compete with other similar schemes, and the regulatory functions they
perform” (8). That said, this report aims to underline the importance, thus, of what
lies at stake in such variability, calling upon the need for objective evaluation
mechanisms, sound ex ante standard-‐setting and ex post indicator-‐selection, and
comprehensive impact assessment in order to evaluate effectiveness. Although such
4 Cafaggi, F. and A. Renda, p. 7.
9
metrics could be applied to any form of management, we shall analyze the
interdependent links between governance and regulatory performance by virtue of
the 5 characteristics set forth by Fabrizio Cafaggi and Andrea Renda’s joint report,
Measuring the effectiveness of private regulatory organizations, stipulating the
necessity of (i) a “wider and more sophisticated evaluation” method, (ii) the analysis
of the regulatory process in its entirety, (iii) the distinction between single and
multiple regulatory schemes, (iv) the determination of the governance, standard-‐
setting, monitoring, and enforcement indicators for each regulatory scheme, and (v)
the analysis of effectiveness regarding the cooperation between and among private
and public regulators.5
The very trans-‐national nature of TPR continuously undermines scholars’ ability to
identify and, thus, define it, as private regulation perpetually slips into new
regulatory space(s). Unlike most regulatory systems, transnational private regulation
does not adhere to any common set of principles, as—despite its universal scope—it
lacks any universally agreed-‐upon definition in jurisdictional and academic literature.
Insofar as not a single type of private regulatory scheme exists, the term is rendered
negotiable (e.g. private governance, self-‐regulation, co-‐regulation). However, in
order to contextualize the term within the scope of this paper, Buthe’s broad
definition echoes the encompassing approach found within the present paper, in
which private regulation “entails private actors playing a major role – at one or more
stages beyond implementation or compliance – in what might be called the
‘regulatory process’.”6 In such a way, assessing the effectiveness of transnational
private regulation requires the analysis of the entire regulatory process—and the
sustainability of that process.
5 idem, p.10. 6 idem, p. 17.
10
GAPS IN THE REGULATORY PROCESS
The evolution of TPR can primarily be linked to the weakness of the state as a global
regulator.7 Such failures sparked the emergence of international institutions in the
first part of the 20th century followed by the emergence of transnational private
regulators in the latter. The limitations of the state can be located in the lack of
cross-‐border effectiveness and the inability to both ensure compliance and enact
enforcement. In order to achieve an effective cross-‐border implementation
apparatus, cooperation among states is necessary, resulting however in high
transaction costs. As a result, diverging implementation coverage and capacity
decentralizes the regulatory process, fragmenting its effectiveness. Such
ineffectiveness – and the resulting gaps in the regulatory process – elicits the
creation of private schemes. As Cafaggi argues, these transformations brought about
by the new private regulatory regimes have also modified the unit of analysis,
shifting from a regulatory state to “regulatory capitalism.”8
The issue of globalization underscores the various factors contributing to the shift
towards private regulatory schemes, such as the emergence of global markets and
trade insofar as goods and services cross state boundaries and, hence, cannot be
monitored by single national legislation. The resulting “externalities” of global trade
and governance transcend domestic borders so-‐to-‐speak, escaping national
jurisdiction and calling upon transnational regulation. As Cafaggi and Renda point
out, international regulatory co-‐operation becomes considerably important in
regards to “global public goods” (e.g. deforestation, emission reduction, food safety,
protection of biodiversity) in order to avoid the tendency of “race to the bottom”
market trends, which implies low market prices due to the absence of quality
control, ultimately undermining the aim of regulation itself. Hence, common
frameworks have been established to promote cooperation among global actors
7 Cafaggi, F. 2011a. New Foundations of Transnational Private Regulation,” Vol. 38, “Journal of Law and Society,” EUI WP, p. 3. 8 idem.
11
(e.g. UN Global Compact, FAO SAFA Guidelines, and the ISEAL Alliance) in the form of
transnational private regulation or “meta-‐regulation.”9
Lastly, the surge of informational asymmetries as new technologies require expert
regulatory bodies, the velocity of market transformations in knowledge-‐intensive
and high-‐tech fields calls upon private parties for fast-‐paced and able-‐bodied
regulation, and the role of global value chains all transcend conventional regulatory
spaces, calling upon TPR to regulate in the space beyond the nation. The nature of
global value chains – and the use of supply chains as instruments for trans-‐national
cooperation – represents a radical new space for transnational regimes to revise the
regulatory process. As we shall discuss throughout the paper, the use of the supply
chain as both the site of and instrument for transnational regulation represents an
innovate approach to effecting effective regulation across multiple forms of
governance and among regulatory relationships. Thus, the use of the supply chain
marks a comprehensive approach to transnational private regulation, speaking to its
regulatory potential in areas of food security, environmental safety, and human
rights protection. In such a way, the increasing role of private actors in both
domestic and transnational levels asks us to “reconsider”10 the functions and
boundaries between private and public spheres, subverting the notion of the
private/public divide as mere alternatives and, instead, suggesting their mutual
inter-‐dependence. Such factors indicate that transnational private regimes house the
resources and competences, which can be aligned with the traditional forms of
international organizations and innovative forms of transnational grids in order to
facilitate cooperation among them rather than competition between them. As
Cafaggi and Renda argue, “trans-‐national private regulation is not an alternative but
rather a complement to international public regulation” (14).
Consolidating issues of fragmentation due to the multiplicity of private regimes or
conflicting domestic public legislation, TPR attempts to harmonize these differences.
Although transnational regimes seek to overcome barriers presented by cross-‐
9 Cafaggi, F. and A. Renda, p. 12. 10 Cafaggi, p. 3.
12
border elements and the public/private divide (e.g. food safety and environmental
protection), Cafaggi notes the recent proliferation of TPR at a global level, suggesting
how the multiplication of private regimes (and the competition between them) may
result in even further fragmentation—global fragmentation.11 Hence, such
proliferation further complicates the harmonization efforts of transnational regimes
towards common objectives, ultimately questioning the “effectiveness” of, for
example, standardization in the regulation of public goods and, thus, invoking us to
call into question the implication(s) of fragmentation.
A CROSS-‐POLINATION OF AGENDAS
As private actors often express diverse views and interests, such results in the
emergence of different organizational typologies of regulatory schemes in function
of the relationship between the regulated and the beneficiaries of a given scheme.
The creation of either single stakeholder organizations (e.g. NGOs, industries) or
multi-‐stakeholder organizations (e.g. multiple actors or members) depends upon the
aims of the regulatory body and the concentration of its power.12 Consequently, as
Cafaggi and Renda point out, a regulated firm has a strong interest/involvement in
the regulatory framework as a means of locating its “voice” within the regulatory
structure (such as industry-‐driven interest groups). Similarly, NGOs attempt to
achieve their objectives (such as the protection from gender inequality) by locating
itself within the regulatory process as well. Situating themselves either “within” or
“outside,” actors seek to actualize their goals by virtue of legal or non-‐legal
instruments.13 This, in turns, impacts effectiveness, marking actor involvement as a
key factor of analysis in the evaluation of regime effectiveness.
However, the conflicting interests and differing policies between industries and
NGOs becomes the very point of their conjunction, in which the two conventionally
contrary bodies tend to reconcile their agendas – by virtue of the underlying TPR
conditions – in forms of cooperation rather than competition. This enacts a sort of
11 Cafaggi, p. 4. 12 Cafaggi, F. and A. Renda, p. 15. 13 idem.
13
“cross-‐over effect” between typically rivaling bodies (industry vs. public interest),
which allows for firms to become more involved (and, therefore, accountable) in
private environmental and social regulation while NGOs can cooperate with
industries in order to leverage regulatory impact in areas generally outside of their
reach. By subverting points of conflict into levers of coordination, transnational
private regulation functions as a “cross-‐pollinating” mechanism between divergent
actor agendas and across different forms of governance.
TYPOLOGIES AND PROLIFERATION OF TPR
The agenda of a private regulatory scheme is directly linked to the nature of its
creation; that is, the factors that led to its creation. Generally, three different types
of TPR can be observed. Firstly, the so-‐called “spontaneous private regulatory
schemes”14 may result in the absence of public or private regulatory enforcement,
the fragmentation of public regulation, or in response to other forms of private
regulation. Insofar as multiple private schemes can co-‐exist, the “spontaneous”
nature of one may act in competition or cooperation with another, depending on its
agenda. In such case, the private scheme is a self-‐regulatory one—directly
unaccountable to public regulators and bound only to national or international legal
rules. Secondly, a “co-‐regulatory scheme”15 constitutes the informal engagement
among public regulators in partnership with private actors in order to regulate
specific functions (i.e. rule-‐setting, implementation, monitoring, enforcement, or all
processes). Lastly, a formalized mandate of governance and regulatory requirements
represents that of “formal delegation,” commonly found in the field of technical
standardization. In such a way, the typology of a given private scheme reflects the
issue at hand (e.g. the failure of public regulation in climate change, the fragmented
regulatory schemes for food safety in the private sphere, and high transaction costs
leading to market failure).16
14 idem, p. 47. 15 idem. 16 idem, p. 49.
14
PLAYERS AND THE PLURALITY OF INTERESTS
The variety of organizational frameworks among private schemes is further
complicated by the relationship among actors within those schemes. The
relationship, then, among the regulators, regulated, and beneficiaries determines
the resulting form of governance. As Cafaggi and Renda describe, a variety of
combinations can arise, pointing out three hypotheses. Firstly, the “regulated” can
be located inside the scheme whereas the “beneficiaries” are located outside. As we
shall discuss later, such arrangement of actors mirrors the framework of corporate
social responsibility schemes, allowing for commitment by members of the scheme
and, therefore, benefiting society through strengthened sustainability.17 Reflecting
this scheme, certification standards for food safety and public good regulation in the
protection of human rights both exemplify the same actor relationship. Secondly,
the “regulated” may be outside the scheme while the “beneficiaries” can be found
inside, such as an NGO setting environmental certification schemes (e.g. ISEAL
Alliance). Thirdly, both the “regulated” and “beneficiaries” can act as participants
within the scheme, such as those multi-‐stake holders whom include both suppliers
and retailers within the regulatory scheme (e.g. Global GAP).18
Cafaggi further addresses in his working paper, New Foundations of Transnational
Private Regulation, the complexity and heterogeneity of the private sphere:
Some are mainly driven by industries; some are promoted by NGOs; others by joint endeavor of industry agreements. While at first sight they are all governed by private actors, they pursue different objectives and incorporate multiple degrees of public interest, depending on the composition of their respective governance bodies and effects they have on the general public. (8)
Consequently, the varying (and conflicting) interests underlying TPR schemes speaks
to the notion of a “plurality of interests,” which leads to diverging regulatory
strategies vis-‐à-‐vis the regulatory relationships and choice of governance models.
Thus, Cafaggi calls for a “disentangling” of such plurality as a means of identifying
the underlying dynamic among actors, governance models, and enforcement 17 idem, p. 51. 18 idem, p. 52.
15
mechanisms in order to question accountability—and, consequently, sustainability.
In such a way, the very composition of the regulatory relationship plays a role in the
effectiveness of the private scheme. Cafaggi argues that the role of the
“beneficiaries” within the regulatory process should be examined in terms of
compliance and enforcement as an index of effectiveness. That is, a beneficiary-‐
inclusive arrangement “redefines the nature of responsiveness and the means
through which effectiveness of the regulation should be measured” (9). Specifically,
effectiveness serves not only to measure the compliance of the regulator, but more
importantly, to assess the final impact of the regulatory process on the
beneficiaries.19
In such a way, the multi-‐stakeholder model exemplifies the interplay between
contrary interests and how they are represented in the transnational private
regulatory framework, particularly in regards to the role of the beneficiary, allowing
for a cross-‐sectoral analysis. The “contractual” nature of the multi-‐stakeholder
schemes functions through a system of regulatory contracts (multilateral, network,
or master agreements) placed along the supply chain as a means of regulating the
entire regulatory relationship and process (i.e. from the retailer to the supplier).20
The network created among retailers, suppliers, beneficiaries, and regulators
represents an inter-‐connected formulation of regulatory management, in which all
players contribute – and are therefore held accountable – to the legitimacy of the
scheme. The importance, then, of analyzing who is the beneficiary and the where the
beneficiary is located vis-‐à-‐vis the regulatory framework (whether “inside” or
“outside” the regulatory space) points to the complexity of private regulation,
ultimately calling into question notions of legitimacy, as interests can be (mis)aligned
along the supply chain. Such questions of legitimacy within contractual multi-‐stake
holders are exemplified in both the fields of corporate social responsibility and food
safety—the (il)legitimacy along the “supply chain” can be questioned by means of
dissecting the “chain” into its regulatory “parts.”
19 Cafagggi, p. 9. 20 idem, p. 13.
16
THE SUBJECT OF REGULATION AND QUESTION OF LEGITIMACY
As the combination of actors varies in private regulation, so does that of the subject
of regulation in consideration of its sector-‐specificity. Variables of composition,
scope, geographical influence, and structures of governance condition the specific
regulatory function and objectives of a TPR scheme, ranging from financial markets,
data protection, e-‐commerce, food safety, environmental protection, and technical
standards.21 Although the scope of regulatory regimes is defined by difference,
Cafaggi and Renda argue, “there are paths of policy integration related to regimes
that used to be separate and pursue conflicting objectives” (52). That is, the
permeation between the public/private and domestic/international divide by trans-‐
national private regulatory regimes creates a new interface for compromise between
conventionally contrasting fields of interests (e.g. environmental protection and
social standards, or e-‐commerce and data privacy). The “integrated” approach
towards regulation becomes one of cooperation, juxtaposing the regulatory
processes of environmental risk management and social standardization within a
single regime in order to promote harmonization in all steps of the regulatory
process—and not purely in the name of a theoretical concept of “regulation.” This
integration allows for an effective means of achieving sustainability. The TPR
strategy—as a systematic one—attempts to foster compliance and lower litigation
costs through tactics of coordination. The studies performed by the HIIL project on
TPR in 2014 state that “the decision concerning which objectives should be
internalized and how conflicts among objectives should be solved is a key dimension
of effectiveness.”22 Thus, the inclusion of conflict-‐resolution tools, management
mechanism, and compliance incentives all within a “single regime” among cross-‐
border, cross-‐sector, and cross-‐governance actors engenders TPR as a forum of
“transparent” dialogue: a space to address trade-‐offs and coordinate intersecting
objectives.
The subject of regulation may also be “institutionalized” by horizontal and vertical
complementarities, leading to even more forms of coordination and governance.
21 Cafaggi, F. and A. Renda, p. 52. 22 idem, p 53.
17
Horizontal complementarities represent those public and private regimes, which co-‐
exist at a transnational level whereas vertical complementarities are constituted
when transnational private regimes are complemented by public regulation at a
domestic level (or vice-‐versa).23 Observing the relationships—spontaneously or
formally generated—between the private and public spheres, the role of the
complementarity reveals the relationship between legitimacy and effectiveness at
stake in TPR. Cafaggi argues how the “institutional complementarity approach”
attests to the effectiveness of private regulation as highly dependent on the
legitimacy of public institutions. However, the join working paper set forth by
Cafaggi and Renda, further complicates the vertical/horizontal complex, suggesting
how the cross-‐referencing between the two often lead to an “umbrella” set of
standards; that is, packaging one set of standards into another.24 The meta-‐
regulation, here, acts as a way to “bridge the gap” between the holes in
public/private regulatory schemes, allowing for specific regulatory goals or standards
to be achieved (e.g. ISEAL Alliance). As a result, the multiplicity of regulatory models
reveals not only the potential proliferation of but also the problematic relationship
among notions of legitimacy, effectiveness, and the interaction between the two.
The “institutional complementarity approach” and the role of “meta-‐regulation” play
a significant role in evaluating the accountability of TPR.
HOW PRIVATE REGULATION OPERATES
In order to assess the accountability—and, thus, effectiveness (and vice-‐versa)—of a
regime, we must first observe how private regulators operate. Cafaggi and Renda
propose five distinguishing characteristics of the private regulatory scheme in order
to assess its overall scope: (i) if agenda setting, (ii) whether the scheme regulates
behavior (such as, codes of conduct) or sets standards on products and services, (iii)
whether the scheme assists or manages the implementation of public regulation, (iv)
embeds enforcement mechanisms (i.e. alternative or complementary to public
schemes), and (v) all of the above).25
23 Cafaggi, p. 16. 24 Cafaggi, F. and A. Renda, p. 57. 25 idem, p. 54.
18
The use of standards codifies private regulation, yet voluntarily. That is, the
standardization of specific technical or contractual quantities and qualities is for the
most part voluntary in private regulatory schemes, granting the “regulated” the
choice to adopt or not adopt such standards. However, the notion of “legally
voluntary” is a slippery one insofar as such standards become mandatory if external
factors (such as market or social standards) oblige the “regulated” entity to adopt
them.26 Further, the incorporation of private standards into legislative or
administrative acts are binding in nature. Considering the “assumption of
voluntariness,” Cafaggi and Renda note that such assumption:
Should be qualified by considering many legal and non-‐legal factors that reduce or eliminate the space of choice for regulated entities. Evaluation of effectiveness changes depending on whether regulated entities can choose or are forced to join on the basis of factual circumstances or legal obligations imposed by public regulation. (55)
Consequently, “voluntary” standards are often, de facto, mandatory, challenging the
space for choice by the regulated entity. While observing the effectiveness of private
regulation, one must consider the relationship among the various regimes within the
scheme, in which such interaction may be facilitated by cooperation or plagued by
competition. The markings of cooperation can be found in either the form of
organization (e.g. association, foundation, for profit, not for profit) or contractual
agreements. As Cafaggi and Renda point out, this set-‐up allows for the regulation of
member behavior, but can also have impacts on third parties” (55).
THE USE AND ABUSE OF THE SUPPLY CHAIN
Thus, the role of third parties vis-‐à-‐vis impact – as impacted or impacting – appears
in the transnational networks of TPR, in which agreements or contracts act as the
predominant tools. In such a way, traditional codes of conduct or guidelines are
coupled with commercial contracts in order to ensure compliance along the supply
chain and across state boundaries, imposing standards on third parties as well.
26 idem, p. 55.
19
Cafaggi describes the use of the supply chain as a “vehicle to implement regulation”
(Cafaggi, 2012b, 2013b). As a result, the consumer may act to define the regulatory
regime itself by incorporating demands at the retailer stage, which are translated
into contractual rules, spill-‐over onto the supply chain, and ultimately mandate
compliance of those very rules along the chain. The role of the “third party,” as
exemplified herein, marks a participatory actor of the regulatory regime by indirectly
enforcing standards onto the regime itself. The consumer demand for human rights
protection, eco-‐labeling, certification, or environmental protection to name a few
(e.g. 10% energy consumption targets for the top 100 clothing factories cited by the
OECD in 2010) speak to the power of third party spill-‐over, in which new instruments
become binding by virtue of legal and non-‐legal enforcement mechanisms. 27
The use of a contract becomes instrumental in the enforcement of international soft
law, acting as a vehicle for the “hardening” of soft law and promoting harmonized
standards at an international level (Cafaggi, 30). This phenomenon can be observed
in the field of food safety, as public soft law becomes binding, or corporate social
responsibility, as the contracting of the supply chain implements international soft
law by virtue of delegating regulation to private bodies via trans-‐national private
regulation regimes, obliging compliance along the supply chain and within the
contractual relationship among suppliers and sub-‐contractors.28 The liability of
compliance with standards set forth by international soft law standards (such as the
10 principles cited in the UN Global Compact) can be enforced by corporate social
responsibility provided for by transnational private regimes, implementing its
“hardened” regulation and imposing the obligation of compliance (e.g. human rights
standards) through enforcement mechanisms.
Another area exemplifying the use of contract law in the supply chain is that of food
safety, in which contractual agreements “harden” international soft law. The
voluntary guidelines set forth by the FAO Codex Alimentarius illustrate the
incorporation of product and process standards within the contractual relationship
27 idem, p.55. 28 Cafaggi, p. 31.
20
(or by reference), allowing the retailer to stipulate employment, product safety, and
environmental clauses in the contract with a given supplier. This shift in the “terms
of exchange”29 integrates concerns, which traditionally were not included in the
contractual layout of the supplier/retailer agreement. Such transformation in the
“terms of exchange” revolutionizes the very function of the contract from a mere
trade apparatus to a regulatory instrument of conduct management, integrating
social, environmental, and equality concerns within the clausal relationship.
However, as evidenced in the area of environmental protection and corporate social
responsibility, potential loopholes in the effectiveness of contractual clauses in TPR
may arise in regards to the enforceability of contractual standards as conditioned by
where the regulatory beneficiary is positioned, rendering the effectiveness of the
contract as dependent on this organizational leverage. Secondly, as Cafaggi argues,
the “right” to monitor does not translate into the “duty” to monitor, exposing the
limitations of contract law. Although a supplier may promise to adhere to worker,
environmental, and consumer obligations, the third party’s incentive to monitor
such adherence is often conditioned by the party with “deeper pockets.”30 In such a
way, the theoretical reach of TPR is challenged by the practical limitations of
enforcement and, consequently, the implications on the effectiveness of the TPR
scheme.
TPR IN SEARCH OF A DEFINITION
Insofar as transnational private regulation does not adhere to a commonly agreed-‐
upon definition nor do its various manifestations – its form, regulatory relationship,
and governance model – conform to a pre-‐determined formula, the assessment of
its effectiveness represents, thus, a challenging feat. The current working paper of
Cafaggi and Renda set forth a “conceptual framework” for identifying the four key
dimensions of TPR in order to, then, assess its overall effectiveness. By establishing a
priori fundamental principles of and for effective regulation, we can then evaluate
the mechanics and operations of TPR in light of this conceptual framework, allowing
29 idem. 30 idem, p. 32.
21
for an informed evaluation of a system’s sustainability. Firstly, the “quality” of
private regulation is often measured according to conventional metrics, such as
“certainty, predictability, lack of ambiguity, efficacy, etc.,”31 taking into consideration
that high quality norms should not interfere with existing or co-‐existing regulatory
regimes. The potential conflict between measurements of “quality,” according to the
criteria of alternative regimes, should be accounted for in the evaluation of
regulatory quality insofar as not all metrics are created (or should be created)
equally, but may vary according to the indicators determined.
A second dimension of private regulation can be characterized by the notion of
“legitimacy,” in regards to the voluntary or non-‐voluntary nature of standardization,
the level of responsiveness, the degree of inclusiveness, and the representative
quality of the scheme in relation to the regulated bodies.32 The resulting class of
legitimacy – voluntary or mandatory – rests in a matter of consent in the former case
and formal delegation in the latter. Hence, “voluntary” legitimacy brings with it a
higher expectation of compliance and effectiveness in market-‐based communities
via private regulatory organizations.
Thirdly, the role of “enforcement” in the form of mechanisms of a legal or non-‐legal
nature: arbitration, mediation, administrative execution, domestic and international
implementation, the use of reputation, “blame and shame” methods, etc.33 Due to
the transposition of enforcement mechanism between public and private regulatory
bodies, the “regulatory space” of private norms must be analyzed within these
overlapping spheres, exerting enforcement through both judicial and non-‐judicial
instruments. The increasing use of non-‐judicial enforcement strategies through
transnational networks can often be found in the area of food safety or non-‐
discrimination standardization for example, in which the complementarity between
administrative public agency and private enforcement mechanisms act by
cooperation, promoting compliance and penalize violation.
31 Cafaggi, F. and A. Renda, p. 60. 32 idem. 33 idem.
22
Lastly, and perhaps most significantly, the dimension of regulatory “effectiveness”
defines the effectiveness of TPR in general. Cafaggi and Renda define effectiveness
as “the extent to which private regulation achieves its objectives and the correlation
between means and ends” (61). In such a way, the effectiveness of transnational
private regulation lies in sum of its individual parts; that is, standard setting,
monitoring, and enforcement. Effectiveness considers both ex ante the relation
between “means” and “ends,” and ex post the degree in which its objectives
were/were not achieved, as well as the positive or negative externalities of a norm
on the actors involved (i.e. the regulates, regulated, beneficiaries).34 Depending,
then, on the regulatory relationship and governance model chosen, the degree of
effectiveness varies. Such calls upon the “interpreters” of effectiveness to inquire
upon the combination of these variables – the type of norm and institutional
framework – as a means to understanding effectiveness, as both an organizationally
and structurally informed metric stick. The very interdependency among all of the
dimensions mentioned above mirrors the “integrated approach” deployed by the
OECD, which attributes an interrelated dimensionality of quality, legitimacy,
enforcement and effectiveness. In other words, the quality of a given scheme
increases its legitimacy, which in turn promotes compliance, that then enables
enforcement, which in turn increases effectiveness, ultimately re-‐confirming its
quality. The causal relationship among the characteristics of TPR allows for a so-‐
called “disentangling” of their interlocked qualities.
HOW GOVERNANCE CAN EFFECT EFFECTIVENSS?
As discussed, the role of the governmental model chosen -‐ as defined by its legal
form and related design -‐ plays an important role in determining the effectiveness of
private regulatory organizations. That said, we have witnessed a causal relationship
between governance and activity, calling into question what are the effects of
governance and, successively, how governance can effect effectiveness? The major
34 idem, p. 61.
23
elements of governance that demonstrate an impact on effectiveness can be related
to eight proxies.
Firstly, the instance of functional separation represents a criterion for good
governance insofar as separation promises the existence of checks and balances
between the main standard-‐setting body (such as the board) and the main decision-‐
making body.35 Without “regulatory space” between standard-‐setting, monitoring of
compliance, and enforcement, the organization runs the risk of covert interests
underlying these processes, undermining the effectiveness of regulation. In such a
way, the lack of separation becomes problematic, especially in the enforcement
stages of compliance, in which a hypothetical organization that does not exhibit
functional separation may be less willing to enforce sanctions or monitoring
compliance on itself given that responsibility becomes an internal “value” rather
than an external indicator to be monitored. Hence, good governance implies
functional separation followed by the coordination of functions separated across the
regulatory process.
The logical progression from functional separation, then, is to cover that of a multi-‐
level structure of the regulatory chain. As the regulatory scheme is made up of multi-‐
boded and multi-‐faceted parts, the role of “separation,” as a functional distinction
(descriptive in nature) and operational one (normative), the degree of separation
both horizontally and vertically impacts the effectiveness of the regulatory scheme.
The trans-‐national nature of TPR implies the structuring of global, regional, and
national levels of standard-‐setting, monitoring and enforcing agents, whereas
functional separation distinguishes among the bodies responsible for these
regulatory processes. Thus, the “degree” and “quality” of coordination among and
across these layers of normative and descriptive separation represent the
dependent variables upon which regulatory performance relies. The blurred
boundaries between governance spheres increase the uncertainty of responsibility
allocation and heighten the degree of structural interdependencies, underlining the
35 Cafaggi, p. 64.
24
necessity to reformulate the unit of analysis. Considering the multi-‐level structure
and complexity of the regulatory chain, the unit of analysis must be reduced to the
individual organization,36 allowing for the measurement and assessment of
regulatory effectiveness in conjunction with the coordination mechanisms employed
across functions and between spheres.
Thirdly, the quality of membership in the organization impacts the assessment of its
effectiveness, depending upon the degree of representation allotted to its members;
that is, the (in)accessibility of a “voice.” The distribution of representative power
among stakeholders directly correlates to the legitimacy of the member-‐based
organization, enabling a system of “checks and balances” by virtue of proportionate
allocation of representation and, theoretically, more “fair” decision-‐making. The
effectiveness of a member-‐based organization relies upon the legal and non-‐legal
tools available to it—the ability to “check” the activity of the board. The normative
variable of inclusiveness, in which all stakeholders relevant to the decision-‐making
and/or standard-‐making process should be present in order to ensure the adoption
of common norms or agreements. The consultation of stakeholders in ex post
evaluation of an organization’s performance plays a key part in assessing
effectiveness.
Another normative variable includes the organization’s choice to regulate for profit
or not-‐for-‐profit, which again influences not only effectiveness but how it should be
evaluated. As Cafaggi and Renda argue, “it affects definition of both means and
objectives” (66). The former may tend to maximize benefits in favor of its members
over the public, whereas the latter tends to maximize public welfare over the
interests of member-‐based interests. This interplay reveals the relationship between
public interest and the relationship between the regulated/beneficiaries.37
Similarly, the source of funding affects the accountability of a private organization.
That is, an organization with multiple financial inputs may be more “free” to pursue
36 Cafaggi, F. and A. Renda, p. 66. 37 idem, p. 66.
25
common and public good objectives, whereas an organization dependent on a single
source of financial support may be notably conditioned by the interests defined by
its funder. In such a way, the heterogeneity of funding represents a crucial factor for
effective governance.
The transparency of the regulatory process – from agenda-‐setting to enforcement –
marks a crucial factor in qualifying effectiveness. That is, the disclosure of financial
and non-‐financial reporting, the communication of decisions, the consultation of
stakeholders, comprehensive ex ante impact assessments, and ex post evaluations
all contribute to the transparency of an organization’s activity, exposing its quality.
The very quality of the regulatory process, thus, is directly related to the
effectiveness of its governance.
Lastly, the notion of accountability emerges in the realm of private regulation,
specifically in reference to those bodies in pursuit of common or public goods.
Adopting the definition of accountability set forth by Cafaggi and Renda, in which an
organization “adequately motivates and explains its regulatory decisions, monitors
results and impacts, and bears responsibility for the failure to achieve its own goals
and objectives,” the level of accountability is proportionally related to the
effectiveness of regulation.
WHAT TO MAKE OF EFFECTIVNESS
The variables of governance that impact the quality of governance – and by
consequence, effectiveness – must, then, be evaluated by virtue of the tools of and
means for measuring that effectiveness. However, the choice of assessment tools
becomes equally as important as the level of effectiveness itself if we understand
the assessment as a means towards higher efficiency and further improvement of
regulation. That said, the importance of stakeholder consultations, ex ante
assessments, and monitoring and ex post evaluation represents not only the
measurement of effectiveness but the very vehicle for achieving it.
26
As previously discussed, the role of stakeholders is evidenced in various steps
throughout the regulatory process, including the early consultations to locate policy
issues, discussion of policy options, and their experience-‐based input as to the
expected effectiveness and potential problematics of a given regulation (i.e.
compliance). However, the legitimacy of stakeholder questions lies in the legitimacy
of the questions asked; that is, only relative questions can expose the relevant policy
issues at stake. In such a way, the consultation exercise can facilitate regulators in
identifying policy issues both ex ante and ex post.
The evaluation of effectiveness occurs most pertinently in the early stages of the
regulatory process; that is, through ex ante impact assessments, which comprises a
qualitative and quantitative based framework for assessing the sustainability of a
given policy proposal whereas ex post evaluation and monitoring allows for policy-‐
makers to consider the quality of legal rules and the accountability of the
regulators.38 This method is best exemplified in the EU’s “Smart Regulation
Framework”39 of 2014, providing for structured guidelines to assess the viability of
policy options by identifying the policy problem, providing the reasoning for
intervention, performing a comparative analysis of policy options (as well as the
“zero change” option), indicating the preferred policy options, and outlining the
criterion used for evaluating its implementation, enforcement, and effectiveness ex
post in order to compare the policy objectives with empirical impacts. The “Smart
Regulation” guidelines suggest, theoretically and linguistically, for the “bettering” of
regulation – that is, the qualitative improvement over time – by virtue of the
evaluation of its effectiveness and the implications on regulatory sustainability.
Hence, the EU’s interpretation of impact assessment reveals a comprehensive
approach that aims not only at the policy-‐making process, but “good” policy-‐making,
underlining the importance of improvement and efficiency through the learning
curve created between ex ante and ex post assessments.
38 idem, p. 69. 39 European Commission (2014), “Regulatory Fitness and Performance Programme (REFIT): State of Play and Outlook,” COM (2014) 368 Final, European Commission Communication, 18 June.
27
This integrated approach towards assessing the effectiveness of the regulatory
process is often referred to as the “policy cycle” approach (Cafaggi and Renda,
2014), calling upon the need of a “complete and sound evaluation toolkit” made up
of ex ante assessment of legislation and legislation measures, interim monitoring
and evaluation of enforcement and compliance, and a pre-‐determined ex post
evaluation for assessing the performance and sustainability of regulation through
cost-‐benefit analysis. Thus, the quality of the “policy cycle” approach marks its merit
as a tool for smart policy-‐making and effective regulation. If, however, the
comprehensive of analysis is compromised – in the instance of the partial
assessment of impacts or if the “right” questions of evaluation are not posed – the
quality of analysis is consequentially unaccountable. In other words, if all the
impacts of the regulatory process are not taken into account, the assessment of
regulatory quality and performance is rendered futile—the impact assessment, de
facto, dismantles itself from within.
THE (MIS)USE OF INDICATORS
The functionality of regulatory impact assessment lies in the choice of appropriate
indicators chosen in relation to the regulatory question under evaluation. Failing to
choose the “right” indicators translates, ultimately, to the failure to assess regulatory
effectiveness. Indicators may be used by a variety of actors and within a variety of
regulatory schemes: (i) for self-‐evaluation in private schemes, (ii) for independent or
contractual evaluation by third parties, or (iii) for the public sector.40 Whereas ex
ante indicators serve to identify the preferred policy choice among alternative
regulatory options, ex post indicators allow for the evaluation of the governance-‐
models chosen, regulatory performance, and to re-‐align the relationship between
instruments and policy objectives. Again, this “policy-‐approach” assessment allows
for complementarity interaction between ex ante indicators (such as governance)
and ex post indicators (such as impact) in order to promote “better” regulation by
virtue of better instruments. The “policy cycle’ approach, which is strongly endorsed
40 Cafaggi, F. and A. Renda, p. 72.
28
by the OECD, relies upon ex ante and ex post indicators as marking the “two most
important appraisal instruments.”41
Furthermore, the European Commission’s endorsement of “fitness checks” in its
“REFIT” communication of 2014 stresses the use of impact indicators, which extends
beyond mere output indicators, in order to assess the impact of regulatory regimes
on widened spheres of policy concern, social welfare for example. In such a way, the
use of indicators – and, most importantly, what they indicate – expands the horizons
of public and private regulatory “checks and balances” into the realms of
transnational private regulatory schemes (such as the ISEAL Alliance), leading to a
hybridization of multiple indicators. Depending on the objective of the regulatory
body, effectives may be measured through “performance” indicators in co-‐
regulatory schemes (public and private regulation), or simply “output” indicators in
technical standardization fields (third party regulation), or through a comprehensive
set of indicators encompassed by “impact” indicators in the trans-‐national arena.
Bearing in mind that output – the material result of a given regulation – is not
synonymous with its impact – the effect, impact indicators transcend the boundaries
of conventional areas of assessment, enabling the scientific-‐based appraisal of
matters such as social welfare, gender equality, and the quality of life. In such a way,
the “policy cycle” approach endorsed by international organizations and the “Smart
Regulation” agenda set forth by the EU demonstrate comprehensive approaches
towards quantifying and qualifying regulatory impact in a revolutionary way,
provoking us to ask what lies at stake in regulation and how is the regulatory process
a socially, economically, and politically informed one.
RE-‐PIECING THE PUZZLE: COMPLIANCE, EFFECTIVNESS, AND THE LEARNING CURVE
Monitoring compliance in relation to the standards, rules, and impacts of the
regulatory scheme marks a fundamental role in the overall evaluation of
effectiveness: the ability to monitor the regulated entity’s compliance (or
incompliance) with rules and the assessment of that entity’s ability to achieve (or not
41 idem, p. 73.
29
achieve) its objectives. Measuring compliance, hence, is representative of a core
concern for regulatory effectiveness (i.e. if the regulated entity meets the
requirements of provisions). Insofar as the notion of compliance concerns both a)
the regulated entities’ compliance with the rules determined by the scheme and b)
the performance and overall effectiveness of the scheme itself42, the former – a –
does not however directly imply the occurrence of the latter – b. Although a typically
leads to b in the above hypothesis, correlation does not necessarily confirm
causation – a post hoc, ergo propter hoc fallacy – in our methodological evaluation of
compliance and effectiveness.
Instead, the actual effectiveness of a regulatory scheme – and its performance
therein – marks the center of our analysis, as an inclusive consequence of the
various “inputs.” Measuring effectiveness in its correlative relation to compliance
allows us to characterize the “collective dimension of compliance,” in which
cooperation between the regulators and regulated enables monitoring in private
regulatory schemes. As Cafaggi and Renda argue, “whether or not regulatory
objectives are met depends on the performance of the regulated as a collective.”
Therefore, the metric of evaluation must account for “interdependencies among
regulated entities’ conducts,” (75) which represents a fundamental concern in the
area of food safety or environmental protection. That is, partial compliance in food
safety or traceability schemes undermines their entire risk management system,
rendering control mechanisms partial and inconclusive and, therefore, ineffective.
Such underlines the need for interdependent cooperation among regulated entities
in order to approach a full account of not only compliance but overall impact. In light
of such argument, Cafaggi and Renda astutely stress:
The focus of effectiveness measurement should thus not be limited to compliance with rules, but rather focus on the achievement of regulatory objectives and the causes for the failures whenever they occur. (75) Such argument draws our attention to the objectives of assessing effectiveness
itself; that is, the analysis of a given scheme’s effectiveness has no value in and of
itself if we do not apply the lessons learned from the weaknesses identified in
42 Cafaggi, F. and A. Renda, p. 74.
30
existing regulatory schemes towards future methods of improvement—the so-‐called
“learning curve.” Fragmenting, here, compliance from effectiveness as no longer
mutually dependent factors, the authors argue that high compliance can be
considered “necessary,” but not always the sufficient condition for the achievement
of regulatory goals. Insofar as compliance represents only one portion for
consideration, the ex post learning process represents an equally important (if not
more) part of the puzzle, as effectiveness measures tangible results. In other words,
the goal – for private regulators – should be that of making private regulation work.
Lastly, the method and quality of enforcement represents a key prerequisite to the
effectiveness of private regulatory organizations. Without enforcement mechanisms
in place, the degree of compliance obviously tends to decrease. The threat of
sanctions or violation mechanisms can only be effective if legitimate enforcement
takes place. That said, the main characteristics for assessing the method and quality
of enforcement are as follows: (i) the scope and quality of enforcement, (ii) the
independence of enforcement (hence, its legitimacy), (iii) the length of procedures,
(iv) the relationship between enforcement by private bodies and public enforcement
bodies (e.g. courts).43 In such a way, the quality of the remedies available to the
regulator determines the quality of enforcement. On the other hand, the
incorporation of non-‐member representation (in addition to member
representativeness) increases the credibility of the enforcement scheme from a
point of view “outside” that of the regulator; the incorporation of due process
requirements, the right to fair trail, impartiality of enforces, access rights, the right
to appeal, and the recognition of third-‐parties rights and the potential impacts on
third-‐parties, transfers a degree of legitimacy to the enforcement activity, ultimately
increasing its effectiveness. Lastly, the use of “reputational effects” – in “public” or
“confidential” settlement of conflict – the effect of staking reputation in public
disputes typically tends to increase the effectiveness and standing of private
enforcement mechanisms from a public perspective.
43 idem, p. 76.
31
THE FIVE Ws OF TPR
As discussed throughout the first part of the paper, transnational private regulatory
schemes can be assessed in terms of quality, legitimacy, enforcement, and
effectiveness. The ultimate role and capacity of a scheme’s effectiveness distills the
other 3 factors into the most determinant factor of consideration: that of
effectiveness, in which the perceived legitimacy of the scheme, quality of rules, and
credibility of enforcement all serve to make a scheme desirable (or undesirable,
depending on one’s objectives) to potential members if indicative of benefits in line
with the given agenda. That said, depending on the policy perspective from which
one stands – public, private, individual, or collective – the perceived effectiveness of
regulatory organizations conditions the appeal of it: the who doing the evaluation,
the what is being evaluated, the how evaluation is organized, the when or where in
the regulatory process does it take place, and the why – in other words, for whom –
all condition and are conditioned by the regulatory scheme chosen.
As argued by Cafaggi and Renda, “what really matters is not whether the evaluation
is performed directly or indirectly by the evaluated entity, but who defines the
purposes and the objectives of the evaluation, and what is made of the results” (77).
Consequently, we can distinguish among transnational private regulators in terms of
who does the evaluating, whether (i) by independent private organizations (usually
for profit or non profit based, or independent organizations in contractual
agreements with the evaluated entity such as the case of certification), (ii) by meta-‐
private regulators (such as ISEAL or NGOs in charge of monitoring on behalf of other
constituencies), or (iii) by public entities.44
The subject or the what should be evaluated, as previously discussed, should inquire
most importantly into the link between the form of governance and the regulatory
process, the inclusiveness of stakeholders, the consideration of not only regulatory
objectives but regulatory impacts as well (and their distributive consequences), and
the selection of indicators and the allocation of their so-‐called “weight” in the
44 idem, p. 78.
32
evaluation process. Although many typologies of TPR exist, we shall narrow our
focus here on to the function of meta-‐regulators, inquiring into the evaluation of
their performance as meta-‐regulators. That is, how should meta-‐regulators evaluate
their effectiveness? Insofar as the objective of meta-‐regulation is the capacity to
change the behavior of the regulators moreover than the regulated, the
effectiveness of the meta-‐regulator’s performance lies in this crux. The ability to
transform the regulator’s relationship in regards to governance and within the
regulatory process evidences, then, the effectiveness of the meta-‐regulator, namely
the capacity to improve regulatory performance over the degree of compliance of
the regulated entity (e.g. ISEAL).45 Meta-‐regulation can be broken down into 2
categories: those with members and those who produce standards and codes to be
used freely by others. Thus, the organizational nature of the meta-‐regulator
influences its effectiveness; that is, the credibility of a member-‐based meta-‐
regulator depends upon its regulatory “independence” from its members (i.e. the
potential influence of stakeholders in agenda-‐setting and the degree of compliance
enforced). In the latter form, such as the GRI scheme, effectiveness can be reflected
by the degree in which those organizations that freely adopt rules and standards
ensure monitoring the compliance with those rules standards. The form of meta-‐
regulation is generally materialized into a set of guidelines to be adopted.
The notion of “benchmarking” comes into play in discussing the where and when
regulation occurs along the regulatory process, specifically in regards to the attempt
to harmonize multiple regulatory schemes along a homogenous axis of comparison.
In such a way, the relationship between the actors in the regulatory process (i.e. if
contractually bound or voluntarily cooperative), affects the institutional set-‐up of the
regulatory scheme. Although benchmarking allows for a direct comparison of
multiple regulatory schemes (such as those carried out in food certification schemes
like that of GlobalGap), the attempt to harmonize according to formal requirements
may, in turn, reduce its ability to actually evaluate “on the basis of real functioning”
(84). In such a way, comparative evaluation becomes problematic if the number of
45 idem, p. 81.
33
standards and implementation practices are too diverse, replacing harmonization via
standard-‐setting with defining common criteria and indicators that create systemic
rather than single organization effectiveness.46 On the contrary, we have observed
an evolution in the attempt to harmonize differences in the area of sustainability,
shifting between the harmonization approach to one more based on coordination,
across sectors, by virtue of drafting indicators to define regulatory performance and
to assess sustainability.47 One of the leading models of these coordination efforts in
meta-‐regulation is illustrated by the SAFA tool set forth by the Food and Agricultural
Organization (FAO) in collaboration with private organizations; its effort located in
the definition of a global Sustainability Assessment of Food and Agricultural Systems
(SAFA), namely through the drafting of comprehensive guidelines.
HOLES IN THE SYSTEM
In their conclusion on the analysis of a selection of private regulatory schemes,
Cafaggi and Renda underline three major issues associated with the appraisal of
their effectiveness. Firstly, ex ante impact analysis “is not very common” and “does
not follow a rigorous methodology,” (99) which undermines the accountability of
governance in the choice of a given standard and, in turn, the ability for internal
members or external stakeholders to quantitatively and qualitatively assess the
performance of the regulatory scheme. Secondly, the authors claim that despite the
prevalence of ex post evaluation, the assessment concentrates on member
compliance, failing to select indicators to measure the achievement of regulatory
objectives and, furthermore, impacts external to the regulatory scheme itself. On a
side note, according to their findings, those who define the indicators are not usually
those who perform the assessment. Lastly, as we have emphasized throughout our
discussion, the exclusion of the regulatory process and the role of governance from
the private regulatory scheme analysis disables the ability to evaluate overall
effectiveness.48 Such criticisms, then, serve as a bridge-‐way between our analysis of
46 idem, p. 86. 47 Although Cafaggi and Renda refer to the strength of SAFA as engaging in the definition of criteria “to define regulatory performance,” (86) I have expanded the tool’s merit to include that of assessing sustainability as well. 48 idem, p. 99.
34
the effectiveness of transnational private regulation – and corporate social
responsibility reporting – to alternative frameworks developed by virtue of private
and public cooperation in the field of sustainability, in which the SAFA tool
represents one of the leading frameworks for assessing sustainability in the food and
agricultural sector. In contrast to the above criticisms plaguing the majority of
transnational private regulatory schemes, SAFA instead represents a comprehensive
tool, subverting TPR’s “holes in the system” into areas of thorough assessment.
Hence, the guidelines comprise an extensive use of ex ante impact assessment and
ex post evaluation, integrating and building upon multiple sustainability frameworks
ranging from corporate social responsibility (CSR) to social and environmental
standards and reporting. In such a way, the FAO performed a sort of “mapping” of
sustainability indicators for the agri-‐food sector, designing a structured framework of
performance-‐based indicators in cooperation with another UN subsidiary body, the
International Trade Centre via the creation of “Standards Maps.”49 The SAFA tool
structurally avoids such methodological flaws, drafting guidelines for the use of
extensive ex ante and ex post impact assessment, the possibility for the regulated
entity to contextualize the default indicators in light of specific circumstances, and
the allocation of “value” in performance-‐based indicators over the mere practice and
target-‐based appraisals. Lastly, the stipulation of governance as one of the 4
overarching dimensions of sustainability in the SAFA guidelines and the underlying
emphasis on external impacts retort the “holes” found in most TPR schemes,
suggesting SAFA to be one of the leading tools in the field of sustainability
assessment.
49 Available at http://www.intracen.org
35
CHAPTER 2: SUSTAINABILITY IN SEARCH OF A DEFINITION
As described in the preface of the Sustainability Assessment of Food and Agricultural
Systems Guidelines (SAFA) of 2013, the principle of sustainable development had
first emerged as a universal concept at the 1992 Earth Summit, after which its
significance continuously mutated in space and time, assuming varied and multiple
meanings. Over the course of nearly 25 years, 200 different types of sustainability
frameworks have developed in thee public and private spheres, national and
international institutions, and domestic and global domains, manifesting in codes of
conduct, CSR obligations, social and environmental standards, and food safety
guidelines. Further, 120 voluntary sustainability standards, including eco-‐labeling,
origins certification schemes, GMO standards, as well as organic certified practices
have emerged, merging and diverging in scope. Despite this notable shift towards
and desire for “sustainability,” a single and comprehensive framework for
sustainability reporting does not yet exist today. Proliferating beyond domestic,
international, and transnational spheres of regulation and between supplier, retailer,
and consumer regulatory relations across the supply-‐chain, the notion of
“sustainability” becomes even more evanescent, slipping between obfuscating
spheres of meaning(lessness). Considering the intricacy (or, perhaps, impossibility) of
compounding multiple frameworks into a single, common language, lens, and set of
rules, the FAO seeks to simultaneously simplify sustainability assessment while
defining a new, ever more encompassing framework for it, establishing a
comprehensive, step-‐by-‐step tool for assessing 4 crucial dimensions of sustainability
– good governance, environmental integrity, economic resilience, and social well-‐
being – along food and agricultural value chains.50 This paper shall argue for the
merit of the SAFA tool in the field of sustainability assessment and reporting, as well
as the implications of it for the future. With that in mind, one must continually call
into question what lies at stake in measuring that which we cannot define; as the
definition of “sustainability” forever escapes the definer, a similar dynamic seems to
50 FAO. 2013. Sustainability Assessment Food and Agricultural Systems Guidelines. Food and Agriculture Organization of the United Nations, Rome (available at http://www.fao.org/fileadmin/templates/nr/sustainability_pathways/docs/SAFA_Guidelines_Final_122013.pdf
36
be taking place in the measurement of it. That is, despite the number of
mechanisms, tools, and metrics for sustainable development perpetually increases
over time, sustainability per se continuously diminishes.
RECONSIDERING SUSTAINABILITY VIS-‐À-‐VIS SAFA
In order to integrate and appropriate the multiple existing frameworks for
sustainability schemes, SAFA offers a “holistic” approach, considering the complex
dimensionality of sustainability by virtue of the incorporation of existing
standardization systems, building upon rather than substituting them. In such a way,
SAFA “reconsiders” existing sustainability schemes and expands upon them,
introducing an umbrella-‐like framework, allowing for a common language for
sustainability measurement. The very making of the tool – as a process of integration
– by the FAO reveals just this, in which the writers “mapped” all sustainability
indicators in the food sector and drafted its goal and scope in 2010, which was
followed by stakeholder surveys targeting field experts from the food and
agricultural industry, public administrations, NGOs, multi-‐stakeholders and
multilateral institutions in mid-‐2011, a cross-‐comparison of standards and indictors
of 44 different systems in late 2011 (i.e. 18 industry standards, 5 farm level systems,
4 systems of multilateral institutions, 7 NGO systems, 5 roundtable standards, and 5
other systems), and the circulation of draft guidelines in January 2012; after a “pilot
phase” of the Test Version 1.0 in late 2012, the SAFA Guidelines Version 3.0 were
eventually approved in 2013.51 Thus, the structure of the current SAFA Guidelines
are, mostly, based upon the existing ISO 140140:2006, the ISEAL Code of Good
Practice and the Sustainability Reporting Guidelines and the Food Sector Supplement
of the GRI.52 By building upon existing schemes rather than replacing them, SAFA
simultaneously expands upon reliable expertise and allows enterprises to utilize
existing data sources, decreasing costs, avoiding duplicity, and increasing efficiency,
attesting to its comprehensiveness and accountability as a tool.
51 FAO. 2012. Sustainability Assessment Food and Agricultural Systems Guidelines (Test Version 1.0). Food and Agriculture Organization of the United Nations, Rome (available at http://www.fao.org/fileadmin/templates/nr/sustainability_pathways/docs/SAFA_Guidelines_12_June_2012_final_v2.pdf 52 Cafaggi, F. and A. Renda, p. 104.
37
The “guiding vision” of SAFA is “that food and agricultural systems worldwide are
characterized by all 4 dimensions of sustainability: good governance, environmental
integrity, economic resilience, and social well-‐being,”53 in which we can observe a
sort of “re-‐consideration” of what it means to be sustainable. The previous dilemma
of TPR and sustainability “in search of a definition,” returns in our account of SAFA,
in which the FAO’s efforts evidence a holistic approach to measuring not only what is
sustainability, but why it is important. Hence, the SAFA “vision” re-‐thinks
conventional notions of “value,” allocating value in terms of performance here,
striving not only for a mere appraisal of a system, enterprise, or even country’s
sustainability, but instead questioning the implications of its current performance for
the future; sustainability reporting becomes the instrument, thus, of improvement, if
implemented within an transparent and aggregated framework, such as that
provided for by the framework in discussion. By breaking down the assessment into
constituent parts, strengths and weaknesses can be identified and, consequently,
improvements can be drafted—the framework itself determines its functionality
(and potentiality).
SUSTAINABLE DEVELOPMENT: WHAT LIES AT STAKE?
Before introducing the assessment framework and how it works, the SAFA
Guidelines first contextualize its vision, approach, the actors involved, and scope. In
light of the 2015 Millennium Development Goals, and in consideration of the goal to
halve the number of chronically undernourished people worldwide vis-‐à-‐vis
sustainable development, such challenge lies in the effective incorporation of the
environmental, economic, and social dimensions of development. As the amount of
the population still suffering from hunger today represents 16% of the global
community,54 1.4 billion people live in extreme poverty (2005), and food security is
“no reality” for 875 million individuals55, and the increasing human demand strips
the ecosystem of its already scarce availability of resources, we are facing a global
53 SAFA (2012), p. 2. 54 SAFA (2013), p. 12. 55 idem, p. 178.
38
crisis of extreme and irreversible magnitude, calling upon the sustainable
management of agricultural land, forestry, water, atmosphere, and biodiversity.
According to the Stockholm Resilience Centre’s report of 2009, humanity has
transgressed the 3 “environmental planetary boundaries within which we can
operate safely,” underscoring the urgent issues we face today as a global
community: climate change, biodiversity loss, and the impact on the global nitrogen
cycle.56 That said, the sustainable management of the earth’s resources marks a
crucial factor in the preservation of it. The purpose of SAFA, thus, seeks to establish
a framework that can systematically address these issues across sectors, nations,
and markets, echoing its vision for “sustainable development:”
The management and conservation of natural resource base, and the orientation of technological and institutional change in such a manner as to ensure the attainment and continued satisfaction of human needs for present and future generations. Such sustainable development (in the agriculture, forestry, and fisheries sectors) conserved land, water, plant and genetic resource, is environmentally non-‐degrading, technologically appropriate, economically viable and socially acceptable. (FAO Council, 1989).
By situating itself along the supply chain, SAFA aims for the long-‐term
transformation of sustainable food systems, which would require the management
of interconnected sustainability issues beyond the supply chain, extending its
analysis onto the environmental impact of a given product through its lifecycle—the
Life Cycle Approach (LCA). Although SAFA incorporates LCA (a product or process-‐
specific assessment) within its analytical framework, SAFA’s approach lies in the
assessment of the enterprises rather than the product. That is, SAFA assesses the
sustainability of an enterprise in terms of economic, environmental, social, and
governance dimensions, allowing the framework to be implemented on a variety of
levels and along different axis of comparison: at a regional, national, or international
level; in relation to the supply chain; or even in regards to an operational unit.57
56 idem, p. 12. 57 idem, p. 7.
39
WHO USES SAFA?
Therefore, SAFA’s multi-‐level application and multi-‐faceted purposes enables a
variety of actors to use its harmonized framework to ensure consistency,
transparency, and applicability. As a voluntary, self-‐evaluation tool for sustainability
assessment, SAFA may be adopted by agri-‐food enterprises for self-‐assessment (to
identify “hot-‐spots” or areas for performance improvement), NGOs and
sustainability standards systems (to monitor impacts, share data, best practices, and
thresholds) and governments, investors, and policy-‐makers (to establish
sustainability goals, implement local planning or procurement, promote investment
or legislation, provide sustainability reporting for informed policy-‐making).58 By
creating a network of sustainability frameworks on which the SAFA Guidelines are
based, actors may adopt the tool for a varied purposes, including i) sustainability
reporting (e.g. CSR), (ii) benchmarking (e.g. standards and codes), (iii) standards
based on production or processing standards (i.e. added-‐value systems vis-‐à-‐vis
transparent and sustainable processes), and (iv) assessment methodologies of the
production unit or supply chain (e.g. LCA).59
As an umbrella-‐like framework, SAFA serves the function of harmonizing a variety of
sustainability tools in order to promote coordination and convergence into a
common framework based on a common language. The guidelines indicate the
methodological and implementation principles (see Figure 2), in which its core
principles (i.e. relevance, simplicity, cost efficiency, goal orientation, performance
orientation, and transparency) echo those cited in the EU protocol for Better
Regulation set forth by the Mandelkern report60 (i.e. necessity, proportionality,
subsidiarity, transparency, accountability, accessibility and simplicity), aligning the
SAFA guidelines with “real” impact assessment guidelines.
58 idem, p. 5. 59 Idem, p. 8. 60 European Commission (2001), “Mandelkern Group on Better Regulation,” European Commission Communication, 13 November.
40
As evidenced, the coinciding and conflicting nature of sustainability assessments
disables the ability to “internationalize” a single metric of sustainability, blurring our
ability to define it. Recalling our discussion of transnational private regulation, in
which the regulatory process transgressed domestic/international and public/private
boundaries in the assessment of an entity’s “sustainability,” the ability to define
what makes a company “sustainable” under the meta-‐regulation perspective, from
which SAFA operates, faces a similar predicament. In order to “fill the gap,” then, in
such absence of the definition of what qualifies sustainability per se, the SAFA
guidelines “methodologize” sustainability performance assessment (by reference to
a set of themes, sub-‐themes and default indicators) in order to attain an overall
report of an entity’s “sustainability.” In other words, SAFA “fills the gap,” by defining
how to measure sustainability, without actually defining it. Reversing conventional
rules of linguistic theory here, SAFA qualifies the means to its significance rather
than qualifying the sign itself.
DEFINING BOUNDARIES: THE SCOPE OF SAFA
Before introducing the procedures and protocol of SAFA, the boundaries of its
framework must first be established. Specifically, SAFA undertakes a “supply chain
scope,” which allows assessment to occur in relation to all entities in the supply
chain, ranging from the input suppliers (e.g. the feed provider), to the site of primary
production (e.g. fisheries), the wholesaler, transporter, and the retailer. However,
the supply chain, under SAFA “jurisprudence,” does not include the
consumer/consumption insofar as the tool cannot be applied to product-‐specific
sustainability. Instead, SAFA applies to all entities in the global supply chain.
According to the guidelines, an entity conducting self-‐evaluation must determine its
“realm of influence” by firstly defining its scope and the boundaries of the
assessment.61
The temporal scope usually represents the activity of a given entity for a period of 1
year in order to serve as the basis to which future assessments can be compared.
61 SAFA (2013), p. 17.
41
However, depending on the specificity of a the subject of assessment (e.g. long-‐term
impacts on an ecosystem) may require a five year temporal scope, which should be
determined according to expert or scientific-‐based knowledge.
The thematic scope of the sustainability context is a condition of its coherent, multi-‐
level structure. The SAFA framework is made up of 4 overarching dimensions of
sustainability (good governance, environmental integrity, economic resilience, and
social well-‐being), which are then broken down into 21 “high-‐level themes” (such as
participation, materials and energy, vulnerability, and labour rights). These high level
themes represent core sustainability issues, which are further distilled into 58 sub-‐
themes related to the food and agricultural sector. Each sub-‐theme is coupled with
sustainability objectives, detailing an expected sustainability performance for that
specific sub-‐theme, allowing supply chain actors in the course of the assessment to
identify “hot-‐spot” areas – risks or gaps in the existing sustainability efforts – to
solicit recommendations for improvement. The SAFA tool includes “default
indicators” for each of the 58 sub-‐themes in order to exemplify what kinds of
questions should be asked in relation to the given issue in question, providing
performance-‐based indicators (with associated criteria and objectives) as a means of
example. However, in the case of small-‐scale enterprises or on indigenous farming
practices, the use of target-‐based or practice-‐based indicators for assessment may
be more appropriate given the limited capacity and resources of evaluating body.
Although SAFA urges for the use of performance-‐based indicators, its flexibility can
be evidenced here, in its use of default indicators, appropriation of alternative
indicators where fit, and varied weighing mechanisms to uphold SAFA’s strict
protocol even when performance may be difficult to measure.
43
SAFA STEP-‐BY-‐STEP
The first section of the SAFA Guidelines describes the overall vision, background, and
purpose of the tool, whereas the second section provides for the actual
implementation of it. The step-‐by-‐step approach stipulates 4 main procedures to be
followed (mapping, contextualization, indicators, and reporting), leading to the final
Performance Report: a descriptive and critical sustainability report of the assessed
entity.
Figure 3: SAFA steps and phases
STEP #1: MAPPING MATERIAL MATTERS
The first step – mapping – comprises the definition of goals and purpose, a
description of the assessed entity (size, position, and market power), the definition
of the scope and boundaries of the assessment (as well as the entity’s sphere of
influence and impact), the description of impact allocation criteria, and a visual
representation of the value chain, the interconnected relationships, and
boundaries.62 In order to complete the above, one must “map” the goals and scope
62 SAFA (2013), p. 25.
44
of the assessment and assessed entity. According to the guidelines, the setting of
goals should “unambiguously state the reasons for doing the assessment, the
intended audience and the intended use of the results,” (25) which builds off the
provisions set forth by ISO in 2009. In other words, one must answer the “5 W’s” as
discussed earlier.
The setting of the scope and boundaries, on the other hand, presents a difficult task
to the evaluator insofar as the notion of a borders are perpetually transgressed –
both spatially and temporally – by virtue of the underlying notion of
interconnectedness that both conditions and is conditioned by globalization,
rendering our ability to distinguish boundaries as a problematic feet. The notion of a
“boundary” is compromised spatially – as spheres of influence and direct/indirect
impact extend beyond defined geographical borders – and temporally – as cause and
effect schemes complicate over time. The complication posed by the necessity of
setting boundaries rests precisely in this point, as impacts (and their “weight”) alter
over time in terms of both quantity and quality, varying between distinctions of
direct and indirect, short-‐term and long-‐term, and acute and obtuse. Thus, as no
entity can be considered (nor assessed) in a vacuum, we must consider the role of
externalities while mapping the scope and boundaries of assessment. Building upon
GRI “Boundary Protocol,” SAFA addresses how “activities involving a complex
network of value chain actors, sometimes around the globe, affect and are affected
by an organization’s social, environmental, and economic performance” (28). Thus,
mapping requires an exercise of inclusion and exclusion in order to understand what
should be measured, how the organizational and operational boundaries are
designed, and where interactions occur within the network of production. As noted,
the larger the company, the greater the diversity of supply chain interaction will be,
calling upon the supply chain map to address its complexity and distilling the
assessment to “material matters,” including those potential spheres of influence
and/or influence and excluding the so-‐called “cut-‐off” areas beyond relevance. In
such a way, “mapping” becomes a collection of “material matters”63—from this
63 idem, p. 31.
45
perspective; boundaries can thus be determined by those issue areas, which could
significantly influence the assessment of the intended report user.
PROCEDURAL ISSUE: HOW TO MAP SMALL-‐SCALE PRODUCERS?
Insofar as SAFA was created for the assessment of both small-‐scale and large-‐scale
producers, the amount of resources and capacity needed for a comprehensive,
performance-‐based assessment is not often accessible to small-‐scale producers to
successfully perform a “true” SAFA assessment according to the criteria required. In
light of such limitations – that is, the lack of “material matters” – the SAFA guidelines
foresee and assuage such problematic by including a number of sub-‐themes
specifically concerning small-‐scale producers and related indicators for assessment.
In such a way, despite the potential lack of existing data, resources, and capacity, as
well as the most-‐likely irrelevance of global indicators for small-‐scale producers,
SAFA claims that its inclusion of tailored indicators (practice-‐based or target-‐based
indicators rather than strictly performance-‐based ones) allows for a small-‐scale
entity to nevertheless achieve high-‐sustainability scores. However, we shall review
the actual effectiveness of SAFA for small-‐scale producers in the following chapter,
as such provisions were laid out ex ante and, therefore, require critical ex post
review in order to determine if SAFA is indeed a tool applicable to both large and
small-‐scale entities.64
In defining the boundaries and scope of SAFA for small-‐scale producers, the
objectives underlying such actors to partake in the assessment play an important
role in defining its scope. That is, SAFA acts as an incentivizing tool for small-‐scale
growers to achieve high performance ratings and, thus, bolster their market
position. The emphasis on “sustainable development,” as paralleled in TPR schemes,
whether through self-‐evaluation or delegated assessment may be desired by the
assessed entity insofar as “being sustainable” equates to credibility, accountability,
64 Please note that another version of SAFA for small-‐scale holders is expected to be released in November 2014. However, the newly adapted version should nevertheless be assessed in terms of legitimacy, accessibility, and effectiveness for small-‐scale producers. That is, does it provide access to data—the key weakness in small-‐scale sustainability assessments—in order to correct what was not working in the original “one size fits all” version of SAFA?
46
and transparency. Such qualities incentivize, for example, local procurement or
foreign investors to invest in “sustainable” companies, as our globalized economy
and transnational regulatory processes, increasingly call upon “sustainability” as the
foundation for a smart, green, and inclusive future. Furthermore, those small-‐scale
growers who may evidence their “sustainable” practices and high performance
ratings through SAFA or alternative sustainability reporting frameworks may profit
from compensation provided by local, regional, national, or international
governments and/or organizations (e.g. Payment for Ecosystem Services and the
development of local legislation with potentially direct effects on entities). Thus, the
incentives behind sustainable development are two-‐fold: both the regulator and
regulated, theoretically, benefit.
STEP #2: CONTEXTUALIZING DATA AND OUR READING OF IT
The necessity to adapt the small-‐scale producer to SAFA by virtue of assessing the
relevance of sub-‐themes and indicators, leads us to the second procedural step in
SAFA—that of “contextualization.” In order to refine the measurements and ratings
of a given assessment, the specific context of each entity must be considered; that is,
the geographical, social, and political background – in addition to capacity for the
entity to access data resources for the assessment itself -‐ must be considered. By
means of contextualization, the apt set of sustainability themes, sub-‐themes and
indicators can be determined in light of their relevance, consequently providing for
the most appropriate sustainability ratings. The main procedures in the
contextualization step involve the review of sub themes (based on their stated
boundaries and objectives) and the review of default indicators (based on their
relevance to the sub-‐themes selected). The act of contextualization contrasts the
“one-‐size-‐fits-‐all” approach of sustainability reporting, allowing for each assessment
to consider the pressing issues underlying a specific entity’s circumstance (e.g.
human rights violations, soil degradation, water scarcity in the area). If a certain sub-‐
theme is not considered relevant in determining the overall rating, SAFA requires the
elimination to be justified in the final Performance Report.
47
The act of reviewing default indicators involves a comprehensive collection of data in
order to “contextualize” the findings/ratings for each sub-‐theme. However, some
small enterprises may be faced with the dilemma of limited data and/or the lack of
resources to obtain data in the attempt to contextualize a default indicator, thus
prompting the entity to seek an alternative method for comparing data (for example,
GHG emissions or biodiversity) in relation to other entities in a similar or comparable
“context.” Although, in this hypothesis, GHG emissions data is not feasible for the
entity to obtain, the assessor may decide to compare practice-‐based indicators as a
proxy for performance—whether it performs better or worse in the given context.
Consequently, in the same way performance-‐based indicators yield reportable
results, the assessor can determine numerical values for its overall scoring on the
practice-‐based proxies, which is then visually expressed as ranging between degrees
of red (i.e. “unacceptable”) and dark green (i.e. “best”). Following such logic, the lack
of primary data does not disable sustainability reporting under the functionality of
contextualization in SAFA. Instead, the SAFA tool’s flexibility addresses such issue of
rigidity, favoring the tool’s accessibility.
Lastly, SAFA also provides for the possibility of creating “contextualized indicators,”
in which the assessor may not find the default indicator as fit. In such case, the
assessed enterprise must comprise a customized metric system for determining the
sustainability score in relation to a relevant sub-‐theme. In such a way, SAFA’s
“holistic approach” enables the assessor to identify “critical areas, based on
materiality principles of the context of that entity” (43), extending sustainability
reporting beyond the stock guidelines.
STEP #3: SELECTING APPROPRIATE TOOLS AND INDICATORS
Building upon the contextualized sub-‐themes from the previous step, the selection
of relevant tools and indicators, further distills the assessment of the sustainability
themes as a means of approaching a more “correct” score. The SAFA guidelines
define “tools” as the variety of commonly used measurement systems or assessment
48
techniques for different sustainability topics.65 The selection of the tools used in
assessment should be based on default indicators (or their substitute), the
availability of data on an entity’s performance, the budget for the assessment, and
the required level of compliance review. The selected tools or standards for data
collection and metrics for measurement should be included in the SAFA Performance
Report alongside the Accuracy Score.
As the type of data and method of data collection may vary among assessors, SAFA
codifies two different typologies: primary data (data collected, measured, calculated,
or estimated from the site associated with the unit processes within the system
boundary)66 or secondary data (data derived from an alternative source).67 In some
cases, existing databases or commonly used tools, metrics, and standards may
facilitate the data collection process, integrating already-‐existing data, avoiding the
duplication of results, and facilitating assessment. The Appendix A therein contains a
list of partner initiatives for existing data sources, which allow the SAFA framework
to serve as an “umbrella-‐like” structure, incorporating existing data reporting (such
as CSR audit reports or GHG Protocol Corporate Accounting and Reporting
Standards) while creating a common language and system of equivalencies in both
the data collection and reporting processes. The possibility of using/accessing
secondary data enables small-‐sized entities to reduce overall assessment burdens
and efficiently appraise its sustainability in order to identify potential “hot spots”
and implement improvement measures. Although primary data is certainly more
accurate, it is also more costly. Therefore, a tradeoff may occur as the user is offered
the choice to incorporate strictly primary data, secondary data, or a mix of the two,
as a means of reducing cumbersome data collection and fostering efficiency, while
nevertheless maintaining quality.
65 SAFA (2013), p. 45. 66 Based on the definition of primary data as found in ISO14044: 2006, which SAFA endorses. 67 idem.
49
Figure 4: Assessing data quality in SAFA
The notion of quality becomes prominent in the calculation of the Accuracy Score,
which rests upon the quality of data collected and the type of indicator used.
The quality of data, as set forth by SAFA, can be categorized “high quality” if primary
data is collected for the prime purpose of the SAFA assessment (and, therefore,
geared towards the 4 overarching sustainability dimensions), or if primary data
collected for another sustainability tool; “moderate quality” refers to secondary data
used as a proxy to make an informed assumption regarding enterprise performance;
“low quality” describes estimates made based on general information that are not of
primary or secondary data nature. Hence, depending on the time frame (i.e. when
the data was collected), the quality of data, and the methodology of data collection
(i.e. primary or secondary), the Accuracy Score can then be calculated for the final
SAFA Performance Report, which is attained by averaging the sub-‐theme, theme,
and overall levels.
SUSTAINABILY PERFORMANCE
In order to determine the level of sustainability performance, an entity should use
the “indicators” as measurements or assessments that provide evidence as to
whether or not a certain condition exists to calculate the Performance Report. The
default indicators were established according to what experts consider to be the
most important components of a sub-‐theme to be measured in order to assess
sustainability at that level. In such a way, default indicators provide standardized
metrics or guidelines for the future of sustainability assessment. SAFA provides for
the rating scale framework of the default indicators (i.e. the range between
50
“unacceptable” and “best” levels of sustainability performance), yet it does not
provide for the full rating scale. The full rating scale, hence, must be “contextualized”
by the assessor itself in light of the entity’s sub-‐theme boundaries and indicator
thresholds established.
Determining the effectiveness, or “weight,” of indicators serves to appraise their
accuracy by outlining thresholds in order to calculate the entity’s rating in the overall
Performance Report. The “hierarchy of indicators” (see Figure 5) indicates the
degree of quality in relation to each of the 3 types of indicators cited. That is, the
“performance-‐based indicators” are “focused on the results of compliance with an
objective and can measure the performance of an operation, identify trends, and
communicate results;”68 “practice-‐based indicators,” on the other hand, represent
process over outcome-‐oriented indicators, referring to compliance with certain
procedural requirements and, therefore, serve as proxies for “good” practice;
“target-‐based indicators” represent those which focus on the planning, policies or
monitoring of targets, where ratings depend upon the degree of progress vis-‐à-‐vis
implementation.
Figure 5: Hierarch of Indicators
68 SAFA (2013), p. 56.
51
The necessity of determining the thresholds – that is, the flexibility of sustainability
reporting in light of diverse contexts – directly correlates to the SAFA score, as a
“fair” and “balanced” assessment. In order to account for the varying “weight” of
indicators, ranging between the most accountable and accurate (performance-‐
based) to less accurate yet sound estimates (practice-‐based), allows SAFA to
differentiate among indicator types and allot varying degrees of “weight” to them. In
order to assure that all sub-‐themes are weighted equally (and effectively), the
weight must be distributed evenly among indicators within each sub-‐theme, for each
dimension. For example, the use of multiple indicators maximizes the potential
scoring; that is, the maximum potential score of 3 indicators related to a single sub-‐
theme allows for a “best” or dark-‐green score (i.e. the highest mark), On the other
hand, the use of only 1 indicator per sub-‐theme would not require any weighing
apparatus, which could potentially lead to limited assessment of the overall
sustainability dimension. The guidelines comprehensively detail how the SAFA
Performance Report should be drafted, located in the appendixes therein, enabling
flexibility by virtue of the variety of indicators and accounting for such differences by
virtue of aggregated weighing systems (see Figure 6).
Figure 6: Rating scale and weights of indicators
Thus, the calculation of multiple indicators and types in the final Performance Report
evidences the holistic approach proposed, presenting the SAFA tool as a landmark
advance in sustainability reporting, capable of addressing tradeoffs and
52
equivalencies; externalities and internal gaps; synergies and impacts; a tool for
improvement.
STEP #4: REPORTING PERFORMANCE IN A “SNAPSHOT”
The final step comprises a visual representation of the SAFA results, a complete SAFA
Performance Report, the disclosure of the procedure, and a critical review. The
“visual representation” protocol describes the attempt to condense the mass of data
collected for each of the 21 sustainability themes, 58 sub-‐themes, and hundreds of
related indicators into a single “snapshot,” visualizing the overall sustainability
performance and sustainability gaps in a graphic: a polygon. This process of data
visualization allows for an entity to “read” the conceptual framework and the
interconnected relations between them by identifying all 21 sustainability themes
around the circumference of the SAFA polygon. The degree of color disseminating
from the core establishes varying degree of performance – along the diameter
stemming from the nucleus – between “unacceptable” (at the core) to “best” (in the
outer ring). In such a way, a think black line connects themes, visualizing
performance by virtue of the user-‐friendly, color-‐coded ranges (dark
green/green/orange/yellow/red) as a means of depicting where performance is
good,
Figure 7: SAFA Polygon
53
The final report comprises an overall synthesis of the assessment, detailing the
scope, boundary setting, qualified themes’ ratings (and Accuracy Report), hotspot
issues, irrelevant sub-‐themes (and the justification of their omission), and areas to
be improved. The disclosure of the procedure used in the assessment speaks to the
accuracy of it, requiring a transparent documentation of the procedure and
methodology used. Such disclosure encompasses the selection of system
boundaries, indicators, and thresholds determined, the data sources (primary or
secondary), and the incorporation of any assumptions regarding the entity’s
performance. Such transparency calls upon the assessor to identify areas where lack
of data or resources were encountered, as well as its justification for excluding
certain sub-‐themes or the use of alternative contextualized indicators instead. The
Accuracy Report can serve as a form of self-‐evaluation for mere internal use or for
external sustainability reporting. In the latter, the transparency of all methodologies
– the boundary, threshold, and indicator settings – “accounts” for the
accountability” of the assessment itself.
Mirroring the procedural outlines of already existent sustainability reporting (e.g.
LCA, GRI, ISEAL Alliance), the act of critically reviewing compliance and indicator
selection/performance by a third party renders it ultimately more tangible, practical,
and “real.” That is, a critical review promotes the quality, credibility, and
transparency of the assessment, identifying areas of sustainable performance and
others in need of improvement, staking the SAFA tool as an instrument of and for
socio-‐political, economic, environmental, and regulatory transformation(s).
54
CHAPTER #3: TESTING SAFA VIS-‐À-‐VIS THE PILOT PHASE
The current version 3.0 of the SAFA Guidelines described above represents an
attempt to consolidate a notion of sustainability within a single, coherent
framework, building upon 3 years of interactive collaboration among public and
private actors in the effort to “distill” the meaning of sustainability, specifically, in
relation to the food and agricultural sectors. As multiple frameworks, standards,
indicators, and technical protocols for sustainable production, manufacturing, and
retailing of food and agricultural products exist, the SAFA initiative of 2009 marks the
first international attempt to harmonize such standards—that is, to construct a
conceptual framework of and for measuring sustainability.
The SAFA Guidelines today, hence, stem from the evolution of negotiations, cross-‐
comparisons, and collaboration among a variety of stakeholders, NGOs, public
institutions, corporate enterprises, and scientific institutions, accounting for the
publication of its first version – the Test Version 1.0 of the SAFA Guidelines – in June
2012. In order to “test” the applicability, usefulness, acceptance, and technically
soundness of the instrument, the FAO held a “Pilot Phase” between October 2012
and February 2013. Insofar as the guidelines are built upon the cross-‐comparison of
codes of practice, corporate reporting, standards, and indicators used by enterprises
and organizations that implement sustainability tools, the participants in the SAFA
trial constituted, then, a group of those very actors as a way for SAFA to “test” itself.
In the course of conducting research on the current SAFA tool, I was kindly granted
the opportunity to access the pilot studies, which allowed me to assess the
methodologies used, sub-‐themes and indicators selected, completeness (or
incompleteness) of the tool, degree of compliance (or incompliance), and method of
data collection and aggregation employed. Consequently, the present chapter shall
discuss the overall successes and shortcomings of the tool, as identified both within
the pilots themselves and by virtue of my observations herein. Noting recurring
obstacles and trends within the pilots, I shall set forth a qualitative assessment of the
failures and virtues, gaps and overlaps, potential areas of improvement and overall
55
effectiveness –and, consequential, utility – of the given instrument in assessing
sustainability.
SAFA “LIVE” FROM THEORY TO PRACTICE
As a tool representing an aim to create a collective understanding of the constitutive
elements of sustainability and a common framework for the assessment of it, in
theory, the pilot studies served to uncover the issues at stake, in practice. Such
studies encompass 23 small, medium and large-‐enterprises from 19 different
countries (i.e. Bangladesh, Bolivia, Canada, Costa Rica, Dominican Republic,
Germany, Ireland, Italy, Nepal, New Zealand, Peru, Sao Tomé et Principe, Spain,
Switzerland, Tanzania, Thailand, UK, and USA), spanning across various sectors,
including crops, livestock, forestry, fisheries, wild harvests, cotton, bioenergy,
tobacco, and peat moss commodities and value chains. Some pilots covered a variety
of processes from primary production to packaging, whereas others focused on a
single site or step in the given value chain, depending on the stated scope and
boundaries of the entity’s assessment.
The overall consensus among participants speaks to the tool as a useful instrument
for self-‐evaluation and internal monitoring as a means of identifying sustainability
“hot-‐spots,” in turn allowing for future improvement strategies. Other
commonalities found within the SAFA Performance Report and the feedback
sections underline the SAFA approach as comprehensive (i.e. extensive themes and
sub-‐themes), systemic (i.e. the mapping of an ecological-‐social-‐economic-‐
governance resilience network), and useful for internal sustainability evaluation (e.g.
the polygon enables a simple visualization of sustainability issue areas).
Despite the concurring attitude towards SAFA’s effort to create an overarching
framework for sustainability as a positive step forward, the majority of pilots (with
the exception of one) stressed multiple challenges encountered, highlighting
structural, operational, and conceptual “gaps” in the (then) current Test Version and
offering suggestions for its improvement in light of the exercise. In order to function
as both an effective and inclusive instrument, the tool would have to consider those
56
limitations and integrate their solutions into a new framework. The most prominent
challenges, which surfaced during the pilot phase expose the following
shortcomings: ineffective compliance checks, indicator challenges (boundaries,
thresholds, and types), inappropriate metrics, the issue of data collection, the
implications for small-‐holders, and the future utility of SAFA. Hence, we shall not
only identify the issues presented during the pilot phase but also inquire as to if and
how the writers of SAFA redressed the underpinning issues at stake in the current
version of the tool.69
SAFA “HOT-‐SPOT” #1: INEFFECTIVE COMPLIANCE CHECKS
As an umbrella framework, SAFA includes compliance checks in order to incorporate
existing sustainability schemes as a means of facilitating assessments by virtue of
benchmarking, consequentially avoiding the “double work” entailed in data
collection and sustainability reporting. This step (step 2) comprises 2 parts, in which
the entity must first determine if a sub-‐theme is “relevant” and, consequently, cite
the means by which it “complies” to the sub-‐theme indicator (e.g. listing the
alternative certification or standards scheme to which it adheres). However, such
relevance questions were often criticized as being “very general” and “not specific to
our area of concern” (as stated by a tobacco producer in Bangladesh), presenting the
entity with the choice of eliminating the sub-‐theme on the basis of irrelevance. As a
result, the following SAFA assessment could, potentially, not cover significant
sustainability dimensions by virtue of the justifiable or unjustifiable decision on
behalf of the assessor – a subjective decision – rendering the notion of relevance
immediately problematic. In the above case, despite the possibility of providing
contextualized indicators for compliance, the entity failed to do so, evidencing the
embedded “option” within the SAFA framework to construct alternative indicators
for compliance shall the entity have both the temporal and motivational conditions
to carry out a comprehensive analysis. In such a way, the relevance and compliance
69 One should note that the original Test Version 1.0 of the SAFA Guidelines comprised 6 steps: 1) Goals and Scope; 2) Relevance and Compliance Check; 3-‐4) Indicator Selection and Data Collection; 5a) Aggregation of results of SAFA themes and sub-‐themes; 5b) Visualization of SAFA results; and 6) SAFA Performance Report. The exercise also included a final “Feedback” section.
57
check may exclude sub-‐themes and indicators, which could potentially expose crucial
hot-‐spots and, in SAFA’s test version, run the risk of being overlooked.
Secondly, the notion of “compliance,” or benchmarking, reveals further obstacles, as
pilot feedbacks speak to how “existing schemes do not comply with indicator
structures,” how “some areas are not covered” in existing sustainability frameworks
(e.g. FSC does not include GHG balances), the difficulty of aggregating LCA data, and
the discrepancy between certification schemes, as not all sustainable practices are
certified (e.g. an Argentinean producer of organic pears and apples is “not always
organic,” resulting in a poor, skewed score under the “Agricultural Biodiversity” SAFA
indicator). In such a way, compliance based on un-‐guided benchmarking exposes
ineffectiveness, as the SAFA framework assumes optimal performance of the sub-‐
theme. However, differences in approach, scope, and scoring do not allow for
ultimate equivalency. A Quebec-‐based pork producer ultimately suggests, the
solution here is to “remove [the] compliance step.”
Figure 8: Entity compliance with SAFA sustainability guidelines
The problematic of compliance cannot only be located within the structural makeup
of the tool but also within the operational aspect; that is, how the entity performs
the compliance check. Reviewing the pilots, the majority of entities failed to follow
0
5
10
15
20
25
Step 1 Step 2 Step 3 Step 4 Step 5 Step 6
# of Completed Pilots per Step
Step 1
Step 2
Step 3
Step 4
Step 5
Step 6
58
the SAFA guidelines in its compilation of step 2, such as selecting “yes” to
compliance without providing the relevant standards (e.g. all 3 pilots performed by a
New Zealand Agribusiness on its organic dairy farms, wineries, and the Ngai tahu
Maori indigenous food programme selected all sub-‐themes as relevant despite many
of them being inaccessible). In addition to inappropriate selection and,
consequentially, distorted scoring, at times the sub-‐themes were considered
“relevant” in step 2, yet “inactive” in step 3-‐4, such as the sub-‐theme “Food Safety”
for the above-‐mentioned enterprise’s assessment of its wine production. In many
cases, such inconsistencies either led to the “best” score, wrongfully, or eliminated
rightful areas of assessment.
Furthermore, the use of compliance checks may create a structural bias towards
large companies, which have the capacity for adhering to multiple schemes of
sustainability. As an organic small-‐grains producer in North Carolina describes:
relevance check questions are poorly worded and exclude many themes that are relevant to all. Only clearly irrelevant sub-‐themes should be excluded. Compliance check will not capture true performance of the farm and will bias safa toward large users who hold many certifications or claims, regardless of the value or rigor of these claims. (Performance Report)
Insofar as the level of verification and degree of compliance among so-‐called
“compliant” alternative schemes are variable, no objective standard or common
ground for sustainability exists, potentially leading to the opposite and least
desirable result: incompliance. As the compliance check renders the value of SAFA as
a rigorous instrument for assessing sustainability questionable here, the SAFA
working group, which convened post-‐pilot phase, addressed this matter by
suggesting to “integrate the results from existing sustainability tools and certification
schemes into SAFA,”70 serving to maintain the integrity of the SAFA tool.
70 SAFA: A Long-‐awaited Step Forward to Understanding and Acting Upon Sustainability in the Food and Agriculture Sectors, 2013. Web. 14 November 2014. (available at www.sciforum.net/conference/wsf2/paper/959/download/pdf).
59
However, one may ask if this integration occurred and, if so, to what extent. The risk
of integrating unequal benchmarks and mis-‐allocating sustainability rigor comes into
question again, which should be explored within the new SAFA Version 3.0. That is,
were different schemes rated, scored, aggregated, prioritized, and/or disregarded?
How are the necessary boundaries of integration framed in the proposed solution?
And, most importantly, does the new SAFA solve the problem of benchmarking by a
selective approach towards integration? As the stated goal of SAFA is “improved
accuracy of analysis of sustainability for all users,”71 one may ask how the new SAFA
differs from the prior framework. In response to such challenge, the Appendix A of
SAFA Version 3.0 – “Selected Sustainability Tools” – provides for the scope of
selected tools as compared to the SAFA landscape, listing a variety of sustainability
tools (e.g. Footprint Calculators, LCA, VSS, GRI, etc.) and allocating compliance
against the “Steps of the Value Chain Impacts Covered” (i.e. production, processing,
and retail) and the “Sustainability Dimensions Covered” (i.e. environment, economy,
governance, and social). In such a way, the current version of SAFA evidences a more
selective approach to identifying the benchmarks for compliance.
SAFA “HOT-‐SPOT” #2: INDICATOR CHALLENGES
SETTING BOUNDARIES: SPATIAL AND TEMPORAL SPHERES OF INCLUENCE
The function of indicators to serve as metrics for actual performance – whether
appropriate or not to given sub-‐sectors and, consequentially, their utility – marks a
major shortcoming in the pilot SAFA framework. Although step 1 calls for the
identification of setting the material and spatial system boundaries of the SAFA
assessment, such process lacks any sense of guidance therein. That is, when
addressed with the question, “Which is the entity’s sphere of influence?” many pilot
studies struggled with the notion of distinguishing between processes along the
value chain in question, which is a fundamental step foreseen by SAFA protocol in
order to carry out a rigorous and scientifically sound impact assessment of entity
performance. As one of the pilot entity’s reported (a processor of organic fruits and
cereals), the assessor was “not sure where the sphere of influence ends.” Unable to
71 Idem.
60
determine the entity’s “sphere of influence” as allocated along the supply chain,
within the company as a whole, or at a certain processing level, they posed the
question, “Is it a farmer or company being assessed?” In such a way, the lack of
guidance leads to an unclear understanding of setting the very scope of assessment
itself, rendering the ultimate sustainability report as a misguided one. Similarly, as an
organic and fair trade coffee producer in Tanzania and Mexico pointed out, the
necessity to differentiate among each element of the supply chain posed a
“conundrum” to the assessor insofar as the spheres of impact allocation within each
step of the supply chain continually shift.
As a result, the necessity to “draw lines” in absence of SAFA guidance lends itself to
multiple interpretations of what defines operational boundaries. Failing to set
minimum requirements for these boundaries revealed a critical gap in SAFA’s ability
to capture negative externalities—and, therefore, assess their impact(s). As
described by the coffee producer in regards to GHG balances, “specific boundaries
are needed for this indicator,” underlining how “with carbon foot printing,
boundaries make all the difference.” Thus, the inability to capture externalities –
whether positive or negative – disables the tool’s efficiency, failing to account for
what lies at stake in sustainability assessment; that is, impacts and implications.
In a similar way, the issue of temporal boundaries emerged within the pilot studies
as well. As SAFA’s default temporal coverage is that of 1 year (with the potential
extension to 5 years for certain indicators), the attempt to assess ecological
processes however exposed another limitation to boundary-‐setting. That is, the
analysis of many ecological processes extends beyond the 5-‐year boundary (e.g.
forests, wetlands, and grasslands), complicating temporal dimensionality where
impacts cannot be seen for up to 20 years in some cases. In such a way, the arbitrary
setting of boundaries here vis-‐à-‐vis short ecosystem timelines risks to either
overlook potentially detrimental environmental impacts (e.g. land degradation and
biodiversity reduction) and/or fail to acknowledge long-‐term sustainability efforts
(e.g. GHG mitigation). As the Feedback Report from a New Zealand Agribusiness
mentions:
61
The 5-‐year stand down threshold period for production from natural habitat is very short indeed! The downstream ecological consequences are expressed in several decades. This may prevent use of SAFA to discourage habitat destruction (was that your intent), the SAFA releases the producers from sustainability score penalties after very little time, so consequences for the producer are very temporary.
Insofar as the SAFA settings limit ecological processes to low downstream
thresholds, the ultimate sustainability score cannot, de facto, be truly representative
of the actual situation, in turn failing to punish unsustainable practices, exempting
producers from maintaining sustainable performance, and ultimately undermining
the very goal of SAFA itself.
In response to such loophole, the SAFA working group proposed to make certain
indicators “measurable,” suggesting for the use of assessment kits for quantifying
environmental indicators, including reference to existing resources.72 Although the
current SAFA Version 3.0 does not include such “assessment kits,” the re-‐working of
indicators (now step 3) integrates metrics for data collection and dedicates a section
to the “Rating of sub-‐themes for [the] environmental dimension,” which differs from
the other 3 fundamental sustainability dimensions. The treatment of indicators
under the environmental dimension incorporates multiple indicators and indicator
types (whether performance, practice, or target-‐based), allowing for a more
comprehensive approach towards data collection and indicator selection. Lastly, the
omission of certain environmental indicators is allowed on the basis of justified
reasoning (if small-‐scale producers, for example, cannot access necessary data). At
the same time, sanctioning mechanisms (i.e. negative scores) are implemented for
cases, in which “regular enterprises omit relevant performance indicators” (63). Such
modifications speak to the evolution of the SAFA tool in the realm of environmental
indicator mapping; when boundaries are narrowed, SAFA aims to maintain its
integrity by virtue of transparent reporting.
72 idem.
62
BETWEEN FLEXIBILITY AND RIGIDITY: INDICATORS IN SEARCH OF A THRESHOLD
As SAFA seeks to guarantee a fair playing ground for all types of enterprises across
regions and sectors, such aim renders the delineation of thresholds problematic.
That is, flexibility offers the ability to address varied and multiple settings, whereas
rigidity seeks to uphold the rigor of SAFA sustainability reporting. Consequently, the
guidelines must find a balance between these two extremes, which represents a
common issue reported in the pilot studies. For example, a smallholder tobacco
producer in Brazil and Bangladesh called for “indicators to be minimized and
précised,” as a peat moss producer in Canada argued:
While it is relevant and essential to define strictly themes and sub-‐themes to assure a common understanding of the issues to be assessed, the proposed indicators are currently too specific and hence, too restrictive to cope with the different activities / practices / behaviours the organisation might have (step 3-‐4, column F). Similarly, proposed questions are often inappropriate (depending of the size of the organisation) or too specific to provide the flexibility needed to document the organisation's behaviour/pratice. We spent a lot of time trying to adjust the indicators/questions to fit with the existing information -‐ which however was both sufficient and relevant to cover the themes and sub-‐themes. More flexibility is hence needed at the "indicator" and "question" levels. (Step 6)
Hence, an observed tradeoff occurs between the general and the specific, in which
the issue of threshold subjectivity oscillates indicators between the realm of the rigid
(and, therefore, irrelevant or exclusive) and the realm of the flexible (and, therefore,
prone to unregulated openness). Furthering its argument, the peat moss producer
underlined how “most issues are common to all,” whether upstream or downstream
and among sectors (livestock, grain production, etc.), yet some are “specific and
should be treated as such.” Supporting such argument, the entity stressed how its
main activity – wetland management – is not covered by the SAFA indicators as is
and, therefore, recommends “sector specificities.”
The pilots reveal the failure to capture the full picture of sustainability despite the
comprehensive scope of SAFA themes and sub-‐themes, calling upon the need for a
core list of required indicators accompanied by precise guidelines for the
63
customization of sector-‐specific/region-‐specific indicators at different steps along
the value chain or level of production. Although the current version of SAFA provides
for review and contextualization of all 58 sustainability sub-‐themes and relative
default indicators, does this resolve the issue at hand? That is, although the
“contextualization” (current step 2) and “selecting tools and indicators” (current step
3) do provide, once again, for the possibility of contextualizing indicators according
to entity-‐specific operations, a suggested database for regional and sectoral-‐specific
data has yet to be created. Given such, are we not stuck in the same situation as
before? Is the new version of SAFA so different from its original form in this respect?
Although the guidelines provide for a broader explanation of default indicators, in
light of the fact that SAFA users may not have the technical knowledge to develop
the indicators themselves (e.g. a small-‐scale producer’s lack of capacity or
resources), the critical element – data – is not made available via a shared database
for contextualizing sectoral/regional/value-‐chain specific indicators. Hence, the
problem persists in the form of the unknown: user subjectivity, varied degrees of
expertise, and unregulated capture of the sustainability scope. Stuck, then, in the
same trappings, the current SAFA guidelines note that “expert knowledge and/or
technical expertise” (34) will most likely be required in order to properly
contextualize indicators, aiming towards, on one hand, a scientifically sound “use” of
the tool yet simultaneously undermining its stated claim to be a tool for all kinds of
potential users.
BLURRED THRESHOLDS: WHAT TO DO WITH THE SPACE-‐IN-‐BETWEEN
The implications behind the ambiguity of thresholds renders the SAFA tool
vulnerable, as indicators become the subject of interpretation and – in the case of
contextualization – subjective constructions(s). In order to analyze the customization
step in practice, the peat moss pilot exemplifies a case, in which customized
indicators were constructed for the environmental dimension, namely Air Pollution,
Water Quality Eco-‐Efficiency, and Waste Reduction and Disposal. For the given sub-‐
themes, the test version default questions for each were quantitative in nature; for
example, regarding waste reduction, the indicator posed the following question: “By
what percentage has the total amount of waste decreased over the last five years?”
64
The ability to customize such indicator however – whether appropriate or not – is
not conditioned by SAFA guidelines on how that customization should take place.
The entity’s tendency, then, to substitute the percentage-‐based questions for each
of the above sub-‐themes with ex ante indicators (i.e. whether the entity has put
actions plans for future mitigation or improvement in place), ultimately undermines
the aim of SAFA to assess an entity’s sustainability over a certain time period. In such
a way, the entity shifted its sustainability scoring from one based on performance
over time (in the past) to another based on improvement measures for the future,
thus modifying the underlying question at hand and potentially altering its score for
the better. Although the functionality of “customized indicators” is an important one
within the SAFA framework in the name of contextualization, the pilot studies reveal
the ever more important need for guidance in that process.
Similarly, other pilots evidenced blurred regulatory distinctions—that is, the
knowhow of where to draw the line. The feedback from a tobacco leaf producer in
Brazil sheds light onto this issue in reference to the sub-‐theme Community
Investment, in which the indicator poses the question, “What share of revenue is
invested into the maintenance or rehabilitation of common goods like soil, water or
forests at a community level?” Scoring below 20%, the entity challenged whether
this indicator should be based on a benchmarking scheme or market-‐based
reference point insofar as “20% of revenues invested in sustainability looks not that
bad, but is considered poor rate (red). What's the reference for what can be good or
bad?” Posing such question (rightfully so), the feedback underscores the implications
of word choice; default questions, hence, must consider what is implied by such
wording, as default indicators act as mediums by which scoring takes place. Another
example can be found in a New Zealand winery pilot regarding the sub-‐theme
Waste, in which the entity claims to fulfill the “best available technology” standard
by indicating how “very little waste” is produced, their practice being “landfill rather
than burn.” However, one may question whether a landfill should really represent an
appropriate benchmark for high sustainability performance within a framework such
as SAFA. Such example, again, points the necessity of re-‐considering threshold
65
design, which depending on the rigor and ultimate intended use of the SAFA may still
constitute an area of concern in the current version of SAFA today.
INDICATOR TYPES
Another common suggestion found among pilot participants was that of creating a
variety of indicator types, as conditioned by the size, scale, capacity, and resources
of the assessed entity and its ability to report upon performance-‐based indicators.
As the SAFA tool is designed to assess the performance of enterprises, many of those
enterprises – represented by virtue of the pilot as proxies – attested to the difficulty
of obtaining measurements for performance-‐based indicators. Examples include the
suggestion to use “practice” over performance scores since “more suitable at the
environmental level,” (a Canadian peat moss producer), the argument for “best
practices at times” (a pork producer in Quebec), the proposal for “more practice-‐
based indicators” (a small-‐scale fishery in the US), and the use of “general
knowledge” over performance-‐based indicators (an organic dairy farm in New
Zealand). In the attempt, then, to maintain the performance-‐based SAFA approach,
the post-‐pilot working group suggested to include a hierarchy of different indicator
types and to create an aggregated weighing system dependent on the type of
indicator selected. Following up on such recommendations, the current SAFA
Guidelines do indeed incorporate an indicator pyramid, detailing a comprehensive
scoring system based on the “type of data” (high, medium, or low quality) and type
of indicator used (performance, practice, or target-‐based). The current hierarchy and
aggregation of scores based on 5 (rather than the previous 4) sustainability
performance levels evidence the broadened structure of the current SAFA
guidelines, acknowledging the network of complexities in sustainability assessment.
SAFA “HOT-‐SPOT” #3: (IN)APPROPERIATE METRICS
The complexity of sustainability reporting can be observed in the intricate number of
SAFA sub-‐themes and indicators, attempting to “make sense” of what constitutes
sustainable performance. The appropriateness of the indicators, therefore, vary on a
66
case-‐by-‐case basis depending on the entity in question—its size, location, sector, and
scope of assessment. As discussed, SAFA attempts to cope with the issue of
relevance by virtue of its “relevance and compliance check,” (step 2) which unveiled
a number of functional gaps within the SAFA framework, ultimately calling upon a
revised understanding of how “relevance” and “compliance” should be evaluated.
The pilot studies served to uncover these “holes” in the system, as default indicators
often posed problems to the assessor in measuring performance for a given sub-‐
theme. We shall investigate a few of the most common challenges found in the pilot
phase and review how the new version of SAFA deals with these challenges.
Firstly, the question of full-‐cost accounting (G5.3, a governance-‐related indicator)
explores the notion of holistic management—a key component of assessing
sustainability through good governance. However, the inability or mere failure of
many of the pilot studies to complete this indicator question reflects two areas of
concern: the first, a question of feasibility and the other, a matter of effectiveness.
The former speaks to the lack of capacity for certain entities to quantify external
impacts (as faced by a coffee producer in Tanzania and a forestry enterprise in
Bolivia). As full-‐cost accounting accounts for a fundamental dimension of measuring
sustainability practices – as impacts are externalities carrying implications – SAFA,
then, is faced with the question of how to address this issue as one appropriate for
all users, thus speaking to the latter point. The Full-‐Cost Accounting indicator still
plays a part in the new version of SAFA, as representative of an effective metric for
sustainability, calling upon a further investigation of how SAFA addresses (or
overlooks) this issue as a collective one.
Another problematic indicator discussed was that of Greenhouse Gases (E1.1), which
we shall discuss in length in the following section regarding the issue of data
collection. However, the indicator presents not only a matter of data availability but
also one of boundaries. As evidenced by the indicator description stating
“Operations contain greenhouse gas emissions as much as possible,” one may object
as to the subjectivity of such measurement. That is, to reduce GHG “as much as
possible” is highly subject to interpretation, ultimately undermining the aim of
67
rigorous sustainability reporting by virtue of slippery boundaries here. Furthermore,
the test version asks users to identify the percentage share of reduction of emissions
over the past year in order to measure performance under the sub-‐theme Air
Pollution. However, the formulation of the question suggests that expansion (mere
company development) would entail a poor score for the company even if mitigation
programmes were successful and reduced emissions proportionally overall. In other
words, a well-‐performing company could score poorly under such formulation.
Again, wording can make all the difference in indicator appropriateness and, hence,
in the correct measurement of sustainable performance.
A third dimension, which proved extremely challenging throughout the pilot studies
was that of governance—one of the 4 pillars of sustainability. In order to analyze the
governance theme, the pilot study performed by a small-‐scale fishery in North
Carolina allows us to explore a few of the issues encountered. The first sub-‐theme,
Governance Structure (G1) assesses corporate ethics and due diligence in the form
of identifying mission statements, codes of conduct, risk management policies, and
stakeholder forums. However, such prerequisites may not be relevant to small-‐scale
farmers, for example, where such formal governance structures do not exist.
Instead, many pilots called for a second indicator, which was not size-‐based (and,
therefore, not only relative to big businesses) but one based on ethics, management
plans, missions, and/or visions as proxies for governance indicators. However, one
may argue here that “Due Diligence” represents a dimension (unlike “Holistic
Management” which is more demanding for small-‐holders) that is, de facto, always
relevant to all users and, thus, the current indicator question should not be altered.
Insofar as small businesses do have stakeholders and should have plans for
informing them of decisions made or including them within the decision-‐making
process, a potential solution to the “Due Diligence” indicator would be one of mere
modification, allowing for informal agreements to attest to good governance as well.
Another problematic governance indicator for the small-‐holder represented that of
Rule of Law (G4), asking the entity to state whether “mechanisms are in place to
ensure remedy, restoration and prevention of the infringement of national and
68
international law.” Insofar, however, as the majority of large operations would have
formal mechanisms and internal audits in place, the likelihood for small-‐holders is
very small on the other hand. As the small-‐scale fishery claimed, such indicator
therefore is “unfair to weigh,” as “no formal mechanisms [are] in place.” In light of
the inclusive nature of SAFA, including small-‐scale and large-‐scale enterprises alike,
the incorporation, then, of both formal and informal mechanisms, contracts, and
agreements is crucial in order to fully assess an entity’s performance in a fair and
balanced way. The current SAFA Guidelines addresses such challenges by re-‐
structuring G1, removing “Governance Structure” and replacing it with “Corporate
Ethics.” Accordingly, “Due Diligence” sub-‐theme rightfully remains in the current
guidelines whereas the wording under G4 was modified to ask if the company “can
provide evidence” as to the existence of remedy in the case of rights infringement.
Figure 10: Good Governance Themes in SAFA
The realm of governance represents the most unattended dimensions of SAFA
overall, as the majority of pilot studies opted out of governance indicators by virtue
of compliance checks, claiming the varied sub-‐themes as “irrelevant,” which often
remained either unjustified in the SAFA Performance report or considered
inappropriate to the small-‐holder. Considering such, the often overlooked dimension
here challenges the integrity of sustainability assessment overall, as regulatory
choices require that forms of governance be effective—and, therefore, provide for
# of Pilots with "Irrelevant" Governance Themes and Sub-‐
Themes
Checked Relevant
Checked Irrelevant
69
sustainability in the long-‐run. The crucial role, then, of the governance dimension
within SAFA recalls our earlier discussion of the choice of governance structures and
the regulatory process as a fundamental relationship under transnational
governance, implicating the overall effectiveness of the regulatory scheme and,
hence, highlighting the issue at stake in SAFA.
SAFA “HOT-‐SPOT” #4: DATA COLLECTION
As previously touched upon, the issue of data collection represents another “hot-‐
spot” within the SAFA Guidelines pilot version, as different approaches lend
themselves to different results, calling upon data collection guidelines and protocols
to ensure accuracy. Such coordination between existing data and secondary data
(i.e. data collected for previously existing sustainability schemes such as LCA, GRI,
etc.) could establish common taxonomy in the field of sustainability—one of the
main goals of SAFA in its effort to create an international, umbrella-‐like framework
for sustainability reporting. However, as discussed, the new version of SAFA lacks a
shared database for benchmarking locally-‐specific practices and secondary data,
namely sectoral and regional-‐specific data needed for those entities that struggle in
the process of data collection. As a result, small-‐holders for example may find
themselves in the same compromised position as before, lacking the means and
method for data collection. The final review process of SAFA, however, if intended
for an external audience (e.g. business-‐to-‐business or business-‐to-‐consumer
communication) does foresee a review of the SAFA assessment. We may once again,
though, question as to how data collection is provided for? How are the existing
meta-‐initiatives treated? Does coordination take place? Although Annex A of Version
3.0 does provide for a more comprehensive means of benchmarking secondary data
sources and, hence, reducing the issue of false equivalence, what steps have been
taken to resolve the issue of data collection in and of itself? Insofar as the capacity
and resources for data collection have not been facilitated vis-‐à-‐vis the existence of
such database, doesn’t such absence reveal the same shortcoming as before?
The most common issues attributed to the process of data collection as reported in
the pilot studies locate either the impossibility to quantify certain sub-‐theme
70
indicators and/or the mere lack of available resources for improving its sustainability
performance. The majority of pilot studies reported the lack of capacity to provide
for the environmental dimension, such as GHG footprinting. As evidenced in the
table below, 19 out of 23 pilot case studies failed to provide data regarding GHG
calculations, as some pilots noted that a quantitative analysis would be simply “too
expensive” (a Peruvian organic fruit farmer commented) whereas others lacked
“local-‐sectoral data” on GHG measurements (ranging from forestry operations in
Bolivia, a small-‐scale fishery in North Carolina, and to a small-‐holder farmer in
Nepal). In the case of unavailable data, entities can either omit the indicator (and,
therefore, not be penalized if such omission is justified) or “customize” the indicator
according to an alternative method of measuring the GHG balance, such as in the
case of a Vancouver urban enterprise with 3 farming operations. The pilot exerciese
exemplifies a strategic attempt to bypass the challenge of GHG measurement due to
the rigidity of the default question. Given the significant cost and time-‐consuming
nature of collecting primary environmental data (such as GHG inventories, soil tests,
biodiversity assessments, water testing, etc.), the entity developed a Rapid Impact
Assessment Matrix (RIAM) tool. Its function represents that of providing a company
with a better idea of how specific urban farming activities directly impact the
environment, allowing a company with more than one operation to break them
down into their smaller components. The RIAM presented 2 main questions:
(1) which specific farm activities have the greatest environmental impacts (negative, positive, neutral, unknown) and (2) what environmental areas should a specific farm prioritize to improve the sustainability of the enterprise? (ENV-‐RIAM)
The entity customized an alternative method for calculating problematic indicators,
such as GHG footprinting in this case, in a way that allowed for data collection and
comparison as a means for assessing its sustainability performance.
71
Figure 9: Availability of GHG Footprinting Data
Following the complexity of GHG indicators lays that of Energy Supply and,
specifically, the issue of non-‐renewable resources. Given the unequal distribution
and allocation of non-‐renewable resources, many entities could not perform well in
regards to such indicator. As the Bolivian forestry and sawmill operator reported,
“fossil fuels have no substitute in this area,” resulting in a poor/red SAFA score
(although the entity should have omitted the indicator in this case). Similarly, a
small-‐holder farmer in India stated, “no other options” were available. The inability
to achieve the highest level of sustainable performance vis-‐à-‐vis best practices,
often, may be due to the geographical (and underlying socio-‐political) conditions,
conditioning any given entity in space and time. Other challenging indicators include
Traceability, as “no publically available product information” was available for the
Brazilian tobacco smallholder, as well as the lack of sectoral data on employment in
the quantification of the indicator Living Wage for a coffee producer in Mexico.
Other pilot studies simply “guessed” no toxic inputs were used in their winery
operations despite having “no data.” That said, both the lack of data required to
complete the SAFA, as well as the lack of resources, such regionally and sectorally-‐
specific issues speak to the underlying need for flexible indicators and customization,
as well as stress the urgency for a shared database. However, as previously
discussed, such flexibility must be accompanied with specific guidelines in order to
Availability/SufAiciency of GHG Balance Data
Unavailable/InsufCicient
Available/SufCicient
72
upkeep the integrity of SAFA reporting in the wake of subjectivity, while broadening
options for those entities with less resources and capacity for data collection.
SAFA “HOT-‐SPOT” #5: IMPLICATIONS FOR SMALL-‐HOLDERS
Challenging the “inclusive” scope of the SAFA aim to function as a tool applicable to
both large and small-‐scale enterprises, the pilot SAFA version failed to cover the
rights and issues of the small-‐holder insofar as the scope of proposed indicators
were often deemed inapplicable or irrelevant. Additionally, the implied costs and
resources required to attain the data for the assessment itself often exceeded their
capacity, ultimately disconnecting the user from the tool. As an organic and fair
trade cocoa producer in São Tomé and Príncipe noted, many indicators were
“irrelevant for small-‐scale operations,” or “not useful at a farmer level,” according to
a small-‐holder tobacco producer in Brazil.
The concurred difficulty to perform SAFA at a “farmer-‐level” exposed the
incoherency within its claim that a sustainability assessment can be carried out at
the “site of action.”73 In response to such inadequacy, an aquaculture processer of
trout developed its own set of “appropriate” questions in the form of a
questionnaire to be used “on the field.” In such a way, the entity re-‐worked the sub-‐
themes and indicator questions found within SAFA in reference to the local and
sectoral indicators for sustainability reporting. However, if all small-‐holders were
required to undergo such an extensive (and potentially expertise required) task, the
utility of the SAFA tool for the small-‐holder, in effect, dramatically reduces.
Another common shortcoming encountered by small-‐scale enterprises was the lack
of regional-‐sectoral data, representing an overall “gap” within the SAFA framework
for potential users. As a small-‐scale tobacco producer in Brazil noted that “no
publically available product information” for traceability indicators were available,
others reiterated the lack of local data for quantitative monitoring and reporting on
GHG indicators. Others, such as a Swiss meat processing company, simply omitted
73 A statement attributed to a New Zealand based organic dairy farmer.
73
the Land indicator as the “entity is too small to deal with [the] topic.” The lack of
local-‐sectoral data parallels the lack of resources available for the small-‐holder as
well. As noted by a Canadian pork association, the farmer’s use of variety seed (a
locally GMO licensed variety) stemmed from the fact that “no non-‐GMO
conventional options are available!” In such a case, the entity would score poorly
under such dimension due to the lack of resources beyond its feasible reach,
highlighting one of the most common dilemmas for the small-‐holder in sustainability
assessments and calling upon the revision of the SAFA framework in order to
incorporate the issue of resource (and data) availability. Hence, the notion of
benchmarking represents an issue for small-‐holders insofar as an entity’s
sustainability “performance” may score poorly simply due to its inability to attain the
information or resources needed to either attest to its sustainability claims and/or
achieve higher degrees of sustainability performance. Hence, geographical
conditions – as well as the underpinning socio-‐political ones – often “condition” the
so-‐called sustainability of a small-‐scale entity under a system of metrics to which it
has no access (nor jurisdiction), ultimately penalizing it for matters beyond its
tangible control.
Thus, in light of the issues exposed – limited existing data, the inappropriateness of
proposed indicatory the lack of capacity to complete the assessment independently,
and the difficulty of the Excel worksheet interface – the SAFA working group
discussed the necessity of incorporating small-‐holders’ rights and changing the scope
of indicators in order to enable applicability. Considering the large number of small-‐
holders who partook in the pilot phase, as evidenced in Figure 11 for a mere
suggestive/representative sample, the role of the small-‐holder within SAFA cannot
be marginalized. Such leads one to ask, does the current version of SAFA address
these gaps? How does SAFA treat the use of performance-‐based and best practice
indicators in terms of the small-‐holder?
74
Figure 11: Small-‐Holder Sample of Pilot Phase
The current version of SAFA does aim to increase the applicability of the tool to
small-‐holders in a number of ways. Firstly, it defines “small-‐scale producers “by
virtue of 3 criteria conditions and restates its “aim to be applicable to both large and
small-‐scale enterprises,” (34) yet does not explain how the issues, which emerged in
the pilots are actually resolved. The new version includes “some sub-‐themes that
specifically address some of the concerns specific to small-‐scale producers, as well as
different types of indicators,” which as discussed previously provide for the a variety
of indicator types (performance, practice, and target-‐based) and aggregated
weighing systems. In such a way, the new version, theoretically, addresses the
necessity to incorporate alternative metrics for the small-‐holder in assessing its
sustainability. However, the underlying lack of data – in the form of a
shared/accessible best-‐practice and local-‐sectoral global knowledge database –
renders the new version of SAFA victim to the same deficit exposed in the original
one; that is, the crucial dimension of access to data perpetually plagues the ultimate
effectiveness of SAFA, specifically for the small-‐holder.
SAFA “HOT-‐SPOT” #6: FUTURE UTILITY?
Insofar as the stated objective of the test version of SAFA presented a tool to assess
an entity’s sustainability through self-‐evaluation and, consequently, for internal use,
Percentage of Small-‐Holders in Pilot Phase
Small-‐Holders
Other
75
many of the pilot studies raised the question of the added benefit of the given tool.
All but one of the pilot studies confirmed the use of the SAFA assessment to be
merely that of an “internal” one given the noted shortcomings identified throughout
the study and suggested areas for improvement (the only one to claim SAFA “ready
as is” happened to be the only SAFA to not score below good/best despite the
incorrect use of indicators throughout the pilot). That said, nearly all entities
questioned the added benefit – and, therefore, future utility – of SAFA as a tool for
assessing sustainable performance given its purely internal aim, considering that
other existing schemes provide for legitimate business-‐to-‐business (B2B) and
business-‐to-‐consumer (B2C) relationships and certification schemes. An organic
cereal and fruit processer in Germany claimed SAFA “would be good if B2C schemes
were set up,” since otherwise companies “will opt for other approaches.” The notion
of external use provides an incentive for users of the tool, as underlined by the
feedback found in the pilot studies. However, as another entity expressed, “If B2C,
SAFA runs the risk of ‘misusing’ the UN name for its own ends,” as SAFA represents
an “honesty based assessment.” Calling upon the need for an external auditor or a
third-‐party review of the SAFA assessment, most pilots concurred such suggestion in
fear of potential “green-‐washing” strategies. In such a way, good governance
improvement would facilitate the legitimate use of SAFA, providing the opportunity
for B2B sustainability relationships vis-‐à-‐vis transparent reporting.
The limited intended use of the SAFA Guidelines, in turn, restricted the potential
functionality of it, confining its use – and future utility – within a quasi-‐incentiveless
sphere. As one entity noted, the bottom-‐up approach of SAFA risks to “sit unused,”
given its “weak influence on decision-‐makers.” The given entity further called upon
sector-‐specific SAFAs. Considering the question of utility, the new version of SAFA
evidences the incorporation of such suggestions, allowing for two forms of
“reporting” in its final step. The introduction of a “critical review” foresees two
potential uses: Level 1 (internal use) and Level 2 (B2B or B2C). The second level, as
suggested, requires an external verification process, allowing for increased
legitimacy and, in turn, increased utility.
76
However, one may recall the “5W’s” mentioned in our discussion of transnational
regulatory schemes here, inquiring as to who does the monitoring and upon what
grounds are metrics selected for compliancy evaluation. Stressing the role of
legitimacy, the effectiveness of SAFA can largely be located within the dimension of
effective governance. In other words, simply looking at the board is not enough,
whereas it is in the very checks and balances of the board that makes all the
difference. In its current form, SAFA represents a complicated process of folding an
evaluation of effectiveness within an-‐other evaluation of effectiveness, creating a
potentially never-‐ending loop of evaluation(s), with the tendency to disconnect the
legitimacy, transparency, and overall effectiveness of the regulatory scheme from
the regulated entity in question. As SAFA aims to simplify and converge regulatory
schemes, it nevertheless faces a perpetual crisis of inter and intra-‐fragmentation.
The “sustainability,” then, of SAFA will lie in its ability to incorporate governance
within its structural, operational, and functional framework for effective policy-‐
making.
77
CONCLUSION
As discussed, the notion of “sustainability” marks a socially, politically, economically,
culturally, and linguistically charged metric for negotiating what lies at stake in
transnational regulation—a space and place for enacting regulatory
transformation(s). The SAFA tool set forth by the FAO represents a comprehensive
effort to systematically address the issue of sustainable development across sectors,
nations, and markets, paralleling the interdependencies of the supply chain with the
contingencies of transnational regulatory relationships. In such a way, FAO’s efforts
evidence a strategic approach of embedding voluntary standards within the
transnational regulatory process, subverting conventional schemes of competition
into ones of potential cooperation. The convergence of conventional domains of
conflict – such as trade and environment protection, or e-‐commerce and data
privacy – speak to the growing integration between regulatory regimes, enabling an
axis of cooperation over competition within the transnational sphere.
Thus, in a modern world plagued by extreme poverty, exponential population
growth, depleting natural resources, and rising global temperatures, the urgency of
sustainable development becomes ever more important, calling upon effective
regulation across multiple forms of governance and among regulatory relationships.
Insofar as the definition of “sustainability” – a slippery linguistic construction
reflecting a given socio-‐political agenda – perpetually escape the interpreter, it
becomes difficult to say if an entity performs sustainably or not, as one never seems
to “know” what sustainability actually means. Furthermore, we have witnessed an
inverse effect in the field of sustainable development; although the amount of
mechanisms, tools, and metrics for sustainability have increased over time,
sustainability in and of itself continually diminishes. FAO’s SAFA platform represents
an attempt to “make sense” of sustainability by virtue of disentangling its
constituent parts and re-‐constructing a comprehensive framework for its
constitution. Following in the footsteps of the efforts set forth by the FAO (and its
hopeful path ahead), this paper has sought to underscore the need for an integrated
approach towards sustainability reporting, encouraging the use of objective
78
evaluation mechanisms, sound ex ante standard-‐setting, ex post indicator-‐selection,
and comprehensive impact assessments in the evaluation of regulatory
effectiveness. Grounding the viability of sustainable assessment in the ability to
integrate its frameworks within the crucial dimension of governance and along the
axis of regulatory relationships, the future of sustainable development rests, then,
within this very crux—a space to negotiate not only what is sustainability but stake
why it is important.
79
BIBLIOGRAPHY Cafaggi, F. and A. Renda. 2014, Measuring the effectiveness of private regulatory organizations, forthcoming. Cafaggi, F. 2011a, New Foundations of Transnational Private Regulation, Vol. 38, “Journal of Law and Society,” EUI WP. Cafaggi, F. 2012b, The Enforcement of Transnational Regulation. Ensuring Compliance in a Global World, Edward Elgar Publishers. Cafaggi, F. 2013a, A comparative analysis of transnational private regulation: legitimacy, quality, effectiveness and enforcement, HILL Report, 2013. (available at http://www.ssrn.org). European Commission (2001), “Mandelkern Group on Better Regulation,” European Commission Communication, 13 November. European Commission (2014), “Regulatory Fitness and Performance Programme (REFIT): State of Play and Outlook,” COM (2014) 368 Final, European Commission Communication, 18 June.
FAO. 1989. Sustainable Development and Natural Resources Management. Twenty-‐Fifth Conference, Paper C 89/2 -‐ sup. 2. Rome.
FAO. 2012. Sustainability Assessment Food and Agricultural Systems Guidelines (Test Version 1.0). Food and Agriculture Organization of the United Nations, Rome (available at http://www.fao.org/fileadmin/templates/nr/sustainability_pathways/docs/SAFA_Guidelines_12_June_2012_final_v2.pdf). FAO. 2013. Sustainability Assessment Food and Agricultural Systems Guidelines. Food and Agriculture Organization of the United Nations, Rome (available at http://www.fao.org/fileadmin/templates/nr/sustainability_pathways/docs/SAFA_Guidelines_Final_122013.pdf). Koritza, L. 2013, Certification showing it’s worth the investment, Sustainable Brands. Web. 14 November 2014. (available at http://www.sustainablebrands.com). SAFA: A Long-‐awaited Step Forward to Understanding and Acting Upon Sustainability in the Food and Agriculture Sectors, 2013. Web. 14 November 2014. (available at www.sciforum.net/conference/wsf2/paper/959/download/pdf).